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THE RELATIONSHIP OF ANXTETY LEVEL TO 
PERFORMANCE IN SERIAL LEARNING 


JANET A. TAYLOR 
Northwestern University 


A number of recent studies. (3, 4, 6) 
have been concerned with an attempt 
to study the effects of different. levels 
of motivation on performance in a 
variety of learning situations. In the 
first of these studies Taylor (6) pro- 
posed the hypothesis that (a) the total 
effective drive strength (D) of a sub- 
ject was, in part, a function of the 
level of internal anxiety or emotionality 
of the subject, and (b) the level of this 
internal emotional state would, in 
turn, be reflected by the responses 
made in a test of so-called manifest 
anxiety. This latter test consisted of 
items taken from the Minnesota Mul- 
tiphasic Personality Inventory that 
had been judged by clinical psycholo- 
gists to be indicative of a carefully 
defined behawior syndrome; manifest 
anxiety. The effect of different drive 
levels on performance in classical eye- 
lid conditioning was thén studied by 
comparing the conditioning curves of 
two groups of Ss, one of which had 
scored at the high end of the scale and 
the other at the low end. ‘The results 
showed that the anxious (high drive) 
group was markedly superior to the 
nonanxious (low drive) group. in the 
amount of conditioning exhibited. A 
subsequent study by Spence and Tay- 
lor (5) confirmed this finding. 





AND 


61 


KENNETH-W. SPENCE 
State University of Iowa 


On the assumption that scores on 
the manifest anxiety scale reflect a dif- 
ference in D, the findings of these two 
studies are in line with Hull’s theoret- 
ical formulation (1) that response 
strength (R) will be a function of exci- 
tatory potential (£), which is in turn 
a function of D and a hypothetical 
learning construct //. More com- 
pletely the theoretical formulation 
may be represented as follows: 


R = f (E) 

E = f (H-X< D) 

D = f (Anxiety score) 

IT = { (No. conditioning trials) 


It is obvious that the higher the 
anxiety score the stronger will be the 
excitatory potential and the: greater 
the response strength. It should be 
noticed, however, that the theoretical 
implication of increasing drive level of. 
S will depend upon the nature of the 
behavior situation. In situations in 
which there is but a single response 
tendency, increase in the strength of 
D results in a higher value of & and 
hence implies a stronger response. 
This is the state of affairs in classical 
conditioning, in which, by virtue of 
the control of the stimulus conditions, 
there is but a single response. In more 
complex learning situations such as 
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trial-and-error or selective learning, 
however, in which there are a number 
.of alternative, competing stimulus- 
response tendencies, the effect of in- 
creasing drive. strength will depend 
upon this initial response hierarchy 
and the relative habit strength of the 
correct (goal-attaining) response in 
. the hierarchy. 
Under certain conditions, particu- 
larly when the habit strength of the 
the correct response is weaker than 
those of the competing responses, rais- 
ing of the drive level would be expected 
_to have the opposite effect to that 
found in classical conditioning, i.e., 


impairment of performance. , This fol-. ° 


-lows from the assumption that drive 
(D). will multiply the habit strengths 
of all responses, correct or incorrect, 
thus increasing the amounts by which 
the excitatory strengths of the incor- 
rect-response or responses exceed that 
of the correct response. The effect of 
this, it will be apparent, would be to 
lower the probability of occurrence of 
the correct response, thus increasing’ 
the number of errors. A larger num- 
ber of trials would also be required 
before the stronger, incorrect responses 
would be weakened below the strength 
of the correct response. 

Still another factor that needs to be 
considered is that raising the drive 
level of S would also have the effect of 
increasing the number of fesponses 
with superthreshold excitatory. tend- 
encies, thus increasing the number 
‘of possible competing, erroneous re- 
. sponses. While these mew super- 
threshold responses would probably be 
weaker in habit strength than the 


correct response, the operation of oscil- - 


lation (sO) would lead to their being 
stronger on some occasions. Thus, 
even in instances in which the habit 
strength of the correct response is 
stronger than any of the altérnative 
superthreshold responses, the effect of 


increasing the drive, and-hence the . 


number of superthreshold responses, 
would be to increase the number of 
errors. ;, 

A study by Montague (3) compar- 
ing the rote serial learning of anxious 
and nonanxious Ss provides evidence 
bearing on these hypotheses. - Mon- 
tague constructed three serial ‘lists of 
nonsense syllables in which the intra- 
list similarity and association values of 
the syllables were manipulated in such 
a manner as to vary the likelihood and 
strength of competing responses in the 
different lists. 
the number and strength of the com- 
peting responses should have been rela- 


tively high in comparison with the 


correct response, the performance of 
the anxious group was significantly 
inferior to that of the nonanxious 
group. The relative performance of 
the groups was reversed, however, on 
the list in which an attempt was made 
to eliminate or minimize the strength 


and number of competing associations. 


These results, it will be seen, are in 
line with the implications of the theo- 
retical model outlined above. On the 
third list, which fell midway between 


. the other two in difficulty, the two 


groupsof Ss did not differ significantly. 

The present invéstigation represents 
a further study of the effect on perform- 
ance of differences in drive strength, 
defined in terms of level of scores on a 
test of manifest anxiety, in a learning 
situation that involves the presence 
of strong competing responses. The 
learning problem was a serial-learning 
situation involving the presentation of 
a sequence of choices between two 


responses, left and right, such as in the © 


typical multiple T-maze situation. 


. Instead of using one of the standard 


types of maze structures, however, a 
memory drum was employed. The S 
was required to anticipate whether the 
correct response was to the left or to 
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the right. The appropriate response 
was indicated either by the word 
“right” appearing in the right window 
of the -drum or the word “‘left’’ in the 
left window. The use of the memory 
drum with its 2-sec. rate of presenta- 
tion of the alternatives provided. for 
the control of the response time by 
forcing S to respond within a certain in- 


terval and thus not allowing for hesita- 


tien and possible verbal rehearsal. 


PROCEDURE 


Subjects. —The Ss were selected from an intro- 
ductory psychology course on the basis of extreme 
scores obtained on a modified form of the scale 
of manifest anxiety developed by Taylor (6). 
All were naive with respect to the experimental 
task and were unaware of the reason for their 
selection as Ss. a 

The anxious group consisted of 20 Ss whiose 
scores on the scale fell in the upper 15% of the 
class distribution while the nonanxious group was 
composed of 20 Ss falling in the lower 15°). The 
raw scores for the nonanxious group ranged from 
3-11 and for the anxious Ss from 25-37, 


Apparatus and procedure.—A Hull-type mem- _ 


ory dram, mounted on a table behind a fiber- 
board screen, was used for presentation of the 
material. The learning task consisted of a series 
of 20 choices, typed on a continuous white tape, 
each choice consisting of cither the word “right” 
or the word “left.” The word “right,” when it 
appeared, was typed on the right side of the tape, 
while “left” appeared always on the left. The 


order of the choices was as follows: LLRLRR-. 


LLRLRRRLRLLRRL. 

The choices were presented serially with a 
2-sec. exposure time and 2-sec. interval between 
exposures.. The intertrial.interval was 6 sec. 

The instructions given to each S were of the 
usual type for serial learning. The Ss were 
required to give a response to each exposure and 
did so with rare exception. Trials were con- 
tinued until S reached a criterion of two succes 
sive efrorless trials. 


REsuLtTs 


The data on the learning in terms of 
errors and trials to the criterion of 
mastery are presented in Table 1. It 


will be observed with respect to both’ 


measures that'the nonanxious Ss were 
superior to the anxious. Since a chi- 


TABLE 1 


Means For Error ann Triat Measures 

















Errors | Trials 
Group >~—— —_— 
Mean omM Mean oM 
Anxious * 157.55 | 17.78 32.75 2.05 


12.94 | 25.12 2.38 





Nonanxious| 118.70 





- square test indicated that the distri- 


butions of both the error and trial 
measures departed significantly from 
normality, a nonparametric test, de- 
vised by Mann and Whitney (2), was 
employed to test the hypothesis that 
there was no difference with respect 
to the distribution of scores for the 
anxious and nonanxious Ss. On the 
basis of this hypothesis the probability 
that the obtained différences_ between 
the two groups of Ss would occur was 
found to be between .O1 and .05 in the 
case of the error measure and less than 
.O1 in the case of the trial measure. 
This finding, it will be seen, is in 
line with the theoretical expectation 
that the performance of the anxious 
(high drive) Ss would be inferior to 
that of the nonanxious (low drive) Ss 
in this competitive learning situation. 
This result is the opposite to that found 
in classical conditioning in which, in 


‘accord with the theory, the perform- 


ance of the anxious Ss was superior to 
that of the nonanxious Ss. 
Theoretically, the difficulty of learn- 
ing the individual choice points is, in 
part, a function of the strength of the 
competing anticipatory and persever- 
ative response tendencies. On this 
basis we should expect the number of 
errors made by anxious Ss to be much 
greater than that made by nonanxious 
Ss on the most difficult choice points, 
whereas this difference would be less 
on the points of choice intermediate in 
difficulty and still less and possibly 
even in the opposite direction in the 
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case of the easiest choice points. When 
the choice points: were arranged in 
order of difficulty, it was observed that 
there was a definite tendency for the 
differences to be relatively greater for 
the difficult choice points than in the 
case of the easy choice points. _A rank 
order correlation between the order of 
difficulty of the choice point and the 
corresponding magnitude of the differ- 
ence between the two groups of Ss in 
terms of mean,errors, was found to be 
.83. This value is significantly dif- 
ferent from zero well beyond the .01 
level of ‘confidence. It was noted, 
moreover, that at only a single choice 
point, the first, did the anxious Ss 
makefewer errors than the nonanxious 
Ss. In view of the ease of learning 
some of the points of choice, the theory 
would have led us to expect a greater 
number of such instances. In this 
respect the experimental findings are 
not entirely in agreement with the 
theory. Whether the theory is incor- 
rect or merely incomplete in the vari- 
ables it specifies is not clear. Further 
experimentation needs to be carried 
out in learning situations in which the 
number and relative strengths of the 
responses competing with the correct 
one can be controlled and manipulated. 


SUMMARY 


The present experiment was con- 
cerned with the relation of drive 
strength (D) to performance in a learn- 
ing situation. The specific theoretical 
implication investigated was that the 
performance of anxious (high drive) 
Ss would be inferior to: that of non- 
anxious (low drive) Ss in a learning 
situation that involved the presence 
of competing responses, i.e., a trial- 
and-error type of learning situation. 

Two groups of 20° Ss each were 
chosen on the basis of extreme scores 





made on a test of manifest anxiety, 
those with high scores being the anx- 
ious (high drive) group, and those with 
low scores being the nonanxious (low 
drive) group. Both groups were run 
in a serial learning situation involving 
the presentation of a series of 20 
choices between two verbal responses, 
saying “left” or “right”? at each point 
of choice in a memory drum setup. 
The criterion of learning was two suc- 
cessive trials in which S successfully 
ahticipated the correct response at 
each choice point. 

The results were in agreement with 
the theoretical expectation that the 
anxious (high drive) Ss would make a 
significantly greater number of errors 
and require a larger number of trials to 
reach the learning criterion. 

It was also found that the points of 
choice that were most difficult and 
hence presumably involved the most 
competition from erroneous response 
tendencies provided the greatest dif- 
ferences between the two groups of Ss. 


(Manuscript received December 26, 
1951) 
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VISUAL-RECOGNITION THRESHOLDS AS A FUNCTION OF 
WORD LENGTH AND WORD FREQUENCY 


ELLIOTT McGINNIES, PATRICK B. COMER, AND OLIVER L. LACEY 
University of Alabama 


Recent experiments dealing with 
the relationship between visual-rec- 
ognition thresholds and the mean- 
ingfulness of verbal stimuli have em- 
phasized the necessity of controlling 
such intraverbal parameters as word 
frequency. Using the Thorndike- 
Lorge counts as an index of relative 
frequency of occurrence, Howes and 
Solomon (1) have demonstrated that 
visual-recognition thresholds for tachis- 
toscopically presented words approx- 
imate a linear, decreasing function of 
the logarithm of the frequency of 
usage. These writers indicate prefer- 


ence for a response probability variable 
to interferences involving perceptual 


mechanisms in experiments where rec- 
ognition thresholds appear meaning- 
fully related to the attitudinal signifi- 
cance of the verbal material employed. 

Since the Thorndike-Lorge word 
counts are derived from a population 
sample and do not take into account 
individual frequencies of word usage, 
Solomon (5) has attempted experi- 
mentally to control relative word fre- 
quencies for a group of Ss before 
determining visual-duration thresholds 
for the words. The Ss leafed through 
cards, each presenting a nonsense 
word, and pronounced each word as it 
appeared. There were 24 nonsense 
words which varied in frequency of 
appearance from 1 to 25 times. Later, 
ten of these words were exposed tachis- 
toscopically along with familiar Eng- 
lish words and unfamiliar nonsense 
words, and their  visual-duration 
thresholds were determined. Results 
of this investigation indicated that 
visual-duration threshold for words 
approximates a negatively accelerated 


decay function of frequency of word 
usage.! 

It seems convincingly established 
that frequency of word usage is related 
to ease of recognition under conditions 
of tachistoscopic exposure. Interest- 
ingly enough, Howes and Solomon (1) 
in first reporting this relationship dis- 
miss the factor of word length as hav- 
ing no significant bearing on their 
results. Although the words em- 
ployed in one of their experiments 
varied in length from 6 to 12 letters, 
they made no attempt to employ this 
variable as an experimental parameter 
or to partial out its possible effect 
upon threshold. In fact, the inves- 
tigators state, ““The length of a word 
did not appear to affect our thresh- 
olds” (1, p. 409). In Solomon’s (5) 
later experiment, the possible differ- 
ential effect of word length was elim- 
inated by employing only 7-letter 
words. 

This assumption regarding the ab- 
sence of a relationship between recog- 
nition thresholds and word length is 
rather surprising in view of the care 
exercised by Howes and Solomon in 
ascertaining the effects of such param- 
eters as type and patterning of letters 
and syllable length on thresholds. 
Moreover, there is evidence from 
research by Zipf (7) that longer words 
occur less frequently in the language 
system. Although length and fre- 
quency would thus influence recogni- 
tion thresholds in a similar fashion, 
the relative contribution of each has 
not been investigated. 

The present experiment was planned 


! This manuscript was very kindly made avail- 


able by Dr. R. L. Solomon. 
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to isolate the effects of both word 
length and word frequency upon 
thresholds of recognition, as measured 
by duration of exposure necessary for 
veridical report. 


PROCEDURE 


The experiment was designed so that the 
results for each S could be summarized in a mul- 
tiple-regression equation relating the three vari- 
ables: word length, word frequency, and thresh- 
olds of recognition. In order to sample an ade- 
quate range of word lengths, words of 5, 7, 9, 
and 11 letters were selected. Each category of 
length contained five words selected so that their 
frequencies of occurrence, as listed in the Thorn- 
dike-Lorge tables (6), fell in the vicinity of 10, 
100, 200, 300, and 400 per million. In deter- 
mining word frequency, we followed the proce- 
dure suggested by Howes and Solomon (1) of 
taking the average of the figures for each word 
representing the Lorge magazine count and the 
Lorge-Thorndike semantic count. Words which 
satisfied the requirements of length and frequency 
were selected by use of Snedecor’s table of ran- 
dom numbers (4). Twenty page numbers rang- 
ing from 1-208 (the number of pages in the 
Thorndike-Lorge lists) were obtained from this 
table. The first word on any such predetermined 
page which met the requirements of length and 
frequency was recorded, and reference was then 
made to the next randomly selected page. If no 
word of the appropriate dimensions was found 
on the designated page, successive pages were 
examined until such a word appeared. This pro- 
cedure was adopted in order to avoid selecting 
words of any particular length or frequency hav- 
ing structural characteristics that would syste- 
matically influence recognition under conditions 
of threshold exposure. 

The 20 words secured in this manner were 
further randomized for position in the serial order 
in which they were presented to the Ss, and four 
“trial” words having different length and fre- 
quency characteristics were placed at the begin- 
ning of the list in order to reduce initial practice 
effect. The resulting set of 24 words was typed 
in capital letters of uniform blackness on a roll 
of cash register paper. This roll was mounted 
in a Harvard tachistoscope so that the words 
could be advanced singly into the viewing rec- 
tangle. The stimulus words are listed in Table 1. 
Brightness of the stimulus field as measured by a 
Weston Master II Exposure Meter was .20 ft.- 
candles. The pre-exposure field reflected the 
same illumination as the stimulus field, and Ss 
remained light adapted throughout the experi- 
mental session. 


TABLE 1 


EXPERIMENTAL Worps LisTep IN ORDER OF 
PRESENTATION TO Ss* 














Frequency vipapeney 
Words (Per Words (Per 
Million) Million) 

Fiere** 231 |SHaPE 405 
Controversy** 83 |Decorum 1l 
Dis.iixe** 123 |AcENT 319 
FortnicutT** 28 |ForcoTTen 305 
SUBSTANTIAL 98 |BeTuoucnatT 6 
STAIN 90 |PossiBILITy 318 
ASSOCIATION 412 |GirtH 
HovusEHOLD 222 +|Pus.LisHeR 95 
WEATHER 391 |MacuINnERY 393 
IMAGINATION 209 +|ELEMENT 305 
TIGHTLY 106 |OrcaN 196 
CouNTERPANE 7 |Insprre 196 





* Half of the Ss received the words in the sotes chun: 
the other half received the four Trial Words and then 
the experimental words in the reverse order beginning 
with “Inspire.” 

* Trial words. 


The experiment was performed in a room 
equipped with sound-absorbing material on the 
walls and ceiling. Twenty Ss, 12 men and 8 
women, were drawn from students at the Uni- 
versity of Alabama. All were naive with respect 
to the problem under investigation. When seated 
before the viewing window of the tachistoscope, 
S was instructed as follows: 


Seat yourself comfortably so that you can 
look through this window. I am going to 
show you some words at rather short expo- 
sures, and at first you may not be able to 
recognize them. As soon as you think you 
see a word, however, make a guess. The 
word will appear where you see the small 
rectangle. In order to avoid eye strain, 
don’t stare at the rectangle all the time. I 
will give you a “ready” signal just before 
exposing the word. Be sure to guess the 
moment you think you recognize a word. 


Wide differences in visual acuity were avoided 
by dismissing Ss whose average duration thresh- 
olds for the four trial words exceeded .20 sec. 
Only three Ss were so disqualified. ‘Thresholds 
of recognition for the stimulus words were ob- 
tained by exposing each word first for a duration 
of .01 sec. and then for durations increased suc- 
cessively by .01 sec. increments until S reported 
the word correctly. Fatigue was reduced by 
allowing each S to rest for a few minutes after 
thresholds for the first 12 words had been deter- 
mined. Any differential effects of practice or 
serial effects on the thresholds of words appearing 
late in the list were balanced out by reversing 
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the order of presentation of the 20 experimental 
words for ten Ss. 

The perceptual “guesses” or hypotheses vol- 
unteered by Ss following prerecognition exposures 
of the words were recorded for later analysis. 


REsuLTs AND Discussion 


Threshold relationships—The first 
step in analyzing the data consisted of 
computing for each S an equation 
defining the regression of threshold 
upon both length and frequency.? 
From these, an equation represent- 
ative of the group was obtained by 
averaging the intercept and slope sta- 
tistics for the individuals. Thus: 





Se Sig -Bes & 


° 
n 


THRESHOLD OF RECOGNITION (sce) 





° 








40 100 200 300 400 
WORD FREQUENCY 


Fic. 1. Regression lines for the 20 Ss relating 
recognition thresholds to Thorndike-Lorge fre- 
quencies of the stimulus words 


Y = 5.48 — .012X, + .43X2. Here, 
Y is the predicted threshold in hun- 
dredths of a second; the constant is 
the intercept value; X, denotes fre- 
quency of occurrence; and X,2 desig- 
nates word length in letters. In each 
case, the 20 partial regression coeffi- 
cients yielding the means were of like 
signs. The standard deviations of the 
means were as follows: mean intercept, 
615; mean regression coefficient for 


2 A correlation of .19 was found to exist be- 
tween the length of the experimental words and 
their relative frequencies. This value was used 
as a correction constant in computing the regres- 
sion equations. 





THRESHOLD OF RECOGNITION (ec) 
& 8 











WORD LENGTH 


Fic. 2. Regression lines for the 20 Ss relating 
recognition thresholds to length of stimulus 
words 


frequency, .0012; mean _ regression 
coefficient for length, .068. All of 
the means differed significantly from 
zero at better than the 1% level of 
confidence. 

In order to demonstrate individual 
differences in trend as well as absolute 
levels of thresholds, regression lines 
have been plotted for the 20 Ss show- 
ing the predicted recognition thresh- 
olds for words of different frequencies 
and different lengths. These are found 
in Fig. l and 2. It should, of course, 
be emphasized that the obtained 
thresholds hold only under the speci- 
fied experimental conditions. 

The general results of the experi- 
ment are summarized in Table 2, 
showing the mean recognition thresh- 


TABLE 2 


Mean Recocnition THRESHOLDs 1n Hun- 
DREDTHS OF A SECOND For ALL Ss 














Length in Letters 
Frequency 
(Per Million) 
5 7 9 il 
10 063 .103 096 126 
100 049 055 056 066 
200 067 .074 066 059 
300 047 059 .062 056 
400 041 051 .058 047 
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TABLE 3 
ANALYsIS OF VARIANCE OF THE Raw Data 
Source af — F 

Individuals 19 94.6 15.8* 
Within Group 361 5.98 

Frequency + 325.2 54.4* 
Length 3 70.7 11.8* 
Frequency X length 12 35.8 5.99* 














* Significant at the 1% level. 


olds for all Ss for the particular cate- 
goriesemployed. Inspection suggests 
at once a rise in thresholds with in- 
creasing length and a fall with increas- 
ing frequency. Analysis of variance 
of the raw data yielded results as 
shown in Table 3. As indicated, all 
effects were significant at far better 
than the 1% level. The significant 
interaction between frequency and 
word length may be described as fol- 
lows: (a) an increase in frequency low- 
ers the thresholds of long words more 
than those of short words; (b) longer 
words are accompanied by higher rec- 
ognition thresholds to a greater extent 
in the case of low-frequency words 
than in the case of high-frequency 
words. In short, one cannot state the 
precise effect of either word frequency 
or word length upon thresholds of 
visual recognition without considering 
the interaction between the two. 
The experimental results thus con- 
firm the general findings of Solomon 
(5) and Howes and Solomon (1) with 
respect to the dependence of recog- 
nition thresholds upon frequency of 
occurrence of the stimulus words in 
the language system. Our results, 
however, not only contradict the con- 
tention of Howes and Solomon that 
recognition thresholds are not affected 
by word length, they also indicate that 
length and frequency interact to a 
significant extent. The frequency 
effect is most apparent with long 


words, while the length effect is most 
striking with words of low frequency. 


Hypothesis analysis—As noted earlier, we 
kept a record of the “hypotheses” volunteered 
by Ss following prerecognition exposures of the 
stimulus words. Other investigators (2, 3) have 
observed that the hypotheses formed by Ss 
attempting to recognize words at subthreshold- 
exposure durations are meaningfully related to 
the stimulus words. That the associational tone 
of the stimulus word, particularly if it has taboo 
connotations, influences the nature of the related 
hypothesis seems established. Regardless of the 
meaning of the stimulus word, however, per- 
ceptual hypotheses tend to be organized around 
the structural characteristics of the stimulus. 
We might ask a further question: Since the fre- 
quency of occurrence of a word is a partial 
determinant of its duration threshold, will its 
relative frequency be reflected in the frequencies 
of the hypotheses that it elicits? 

In order to answer this question, we looked 
up each of the recorded hypotheses in the Thorn- 
dike-Lorge tables and noted the mean frequency 
of the “L” and “S” entries. We then computed 
a product-moment correlation for each S based 
upon the previously determined Thorndike-Lorge 
frequencies of the experimental words and the 
mean Thorndike-Lorge frequencies of S’s hypoth- 
eses to each of these words. If S volunteered no 
hypotheses to any stimulus word, that word was 
disregarded in computing the correlation. The 
20 correlations so obtained were converted to z 
scores, and the significance of the mean z score 
was evaluated by means of thet test. The mean 
of the 20 z scores was —.121. This does not 
deviate significantly from zero. We may there- 
fore conclude that there is no systematic rela- 
tionship between the semantic frequency of stim- 
ulus words and the frequencies of the hypotheses 
elicited by these words. 

This finding is probably explained by the fact 
that there was a significant tendency on the part 
of Ss to formulate hypotheses which in general 
had higher Thorndike-Lorge frequency listings 
than those of the related stimulus words. The 
average frequency of occurrence of all the experi- 
mental words to which hypotheses were volun- 
teered was 169 per million. The average fre- 
quency of the hypotheses made in response to 
subthreshold exposures of these words was 438 
per million. The difference is significant at the 
1% level of confidence. Regardless of the fre- 
quency of occurrence of a stimulus word, Ss 
apparently formulate hypotheses that are rela- 
tively common in everyday usage. In addition, 
Ss in the present study followed the usual tend- 
ency of basing their hypotheses primarily upon 
the structure of the stimulus words. This was 
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true whether the stimulus word was of low or 
high frequency. 


SUMMARY 


Thresholds of recognition for words 
varying in length and frequency were 
determined for 20 Ss. Despite indi- 
vidual differences in slope, and level 
of perceptual acuity, regression equa- 
tions determined for each S indicated 
that duration thresholds for neutrally- 
toned words are a linear, decreasing 
function of word frequency and a 
linear, increasing function of word 
length. These findings hold true for 
words varying from 5 to 11 letters in 
length and from 10 to 400 occurrences 
per million in frequency. Analysis of 
variance of the data also revealed a 
significant interaction effect between 
frequency and length. This relation- 
ship seems to be one in which an in- 
crease in frequency lowers recognition 
thresholds more for long words than 
for short words. An increase in word 
length, on the other hand, raises 
thresholds more strikingly for low-fre- 
quency words than for high-frequency 
words. 

No significant relationship was ob- 
served between the Thorndike-Lorge 
frequencies of the stimulus words to 


which the Ss volunteered hypotheses 
and the frequencies of the hypotheses. 
The mean Thorndike-Lorge frequency 
of the hypotheses formulated by Ss 
prior to correct recognition was signifi- 
cantly larger, however, than the mean 
frequency of the stimulus words elic- 
iting these hypotheses. 


(Manuscript received October 9, 1951) 
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In recent years it has been recog- 
nized that the quality and effective- 
ness of an individual’s thinking is 
related to the language or symbol 
system he uses as well as to his indi- 
vidual capacities and skills. In prob- 
lem solving the particular symbols used 
will limit or give direction to the types 
of solutions that will be employed (7). 
Similarly, in the conceptual organiza- 
tion of a number of symbols repre- 
senting common objects, the type of 
symbols used may influence the number 
and quality of the concepts attained. 
The psychologist has reason to find 
measures of how effectively persons 
can use various symbols to organize 
the objects of their experience. If all 
symbols are classified in a systematic 
way, it then becomes possible to estab- 
lish whether there is a relationship 
between variations in symbols and dif- 
ferences in the concepts that individ- 
uals attain. One systematic way for 
classifying all symbols is that based 
upon their level of abstraction. 

In an unpublished exploratory ex- 
periment by the writer two levels of 
abstraction were represented by a 
verbal and a pictorial representation 
(symbol) for each of a group of fa- 
miliar objects. The names, such as 


1 Based upon a dissertation submitted in par- 
tial fulfillment of the requirements of the degree 
of Doctor of Philosophy, in the Department of 
Psychology of the University of Pennsylvania. 
The writer is indebted to Dr. Francis W. Irwin, 
under whose friendly supervision the research 
was conducted, and to Dr. Samuel W. Fern- 
berger, Dr. James C. Diggory, and Dr. Mortimer 
Garrison, Jr., who served on the advisory com- 
mittee. Miriam B. Davidon cheerfully con- 
tributed many hours to the scoring of results. 


“coal shovel” and “leather glove,” 
were considered to be the more ab- 
stract, to represent wider classes of 
referents, while the photographs of the 
corresponding objects were accepted 
as less abstract because they offered 
many specific, concrete details. Sig- 
nificantly more concepts were attained 
with the pictorial symbols than with 
the verbal ones, and these results have 
led to the present development and 
testing of a general theory of concept 
attainment. 

Although experimentalists have 
studied behavior variously referred to 
as concept formation, concept attain- 
ment, generalizing abstraction, and 
abstract and concrete behavior, there 
has been relatively little experimental 
study of the day-to-day activity where- 
by persons use symbols to organize 
and differentiate common events and 
objects. A general theory is required, 
a theory which is not confined to the 
acquisition of ordinary concepts during 
childhood or to the formation of new 
nonsense concepts in the laboratory. 

The term concept formation implies 
that concepts are formed once by a 
special learning process, after which 
they are merely remembered and used 
overandover. This is surely an over- 
simplification, for although one may 
learn to relate a word (symbol) to a 
standard definition, each time he effec- 
tively uses the word the psychological 
event and the meaning will be some- 
what different. More than memory is 
involved, for an organization of the 
whole cognitive field is necessary. The 
effectiveness of the individual’s beha- 
vior is dependent upon the extent of 
his discriminatory organization, not 
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merely upon which concept he is able 
to recall. 

If one considers the perceptual 
organization in first learning to use a 
symbol as others use it to be similar to 
the organizational process through 
which one later applies the same sym- 
bol to a specific situation, the distinc- 
tion between concept formation and 
the use of concepts becomes of less 
importance. With the assumption 
that these psychological processes are 
essentially similar, one may relate the 
present experiment involving the 
organization of familiar objects to pre- 
vious experiments involving the for- 
mation of new concepts. 

Most of the research in concept 
attainment has been accomplished by 
two groups of psychologists whose 
methods and aims have been quite dif- 
ferent. The first group has used mem- 
orization and serial learning techniques 
to study different stimulus variables, 
and has expressed results largely in 
terms of the number of learning trials. 
The experiments of Hull (8) and of 
Heidbreder (4, 5, 6) are representative 
of this group. The second group, in 
which Goldstein (2) and Hanfmann 
(3) are included, has stressed qualita- 
tive differences in response variables, 
such as “concrete and abstract atti- 
tudes” or the “ability to shift,” and 
has used sorting techniques to appraise 
the individual’s behavior. 

The experiment to be described re- 
flects the influence of both points of 
view. The quantitative effects of 
stimulus variables are emphasized, 
although serial learning techniques are 
rejected. Concept attainment is not 
conceived as a simple associative learn- 
ing process or a form of discriminative 
conditioning which can be measured 
by the number of exposures or the time 
elapsed. Rather, a situation has been 
defined in which S can exercise abstract 
organizational ability, in which con- 


cepts are not rigidly limited by the 
nature of the stimuli. 


HypoTHEsEs 


I. On the average a normal adult 
will attain fewer different concepts 
when the symbols representing com- 
mon objects are at a higher level of 
abstraction than he will when the sym- 
bols are at a lower level of abstraction. 

II. (a) Fewer concepts will be at- 
tained if an individual has to make an 
initial shift from one conceptual organ- 
ization of a group of symbols to 
another organization than if he begins 
the task without the first “irrelevant” 
organization. (6) If the previous hy- 
pothesis is verified, then the higher the 
level of abstraction of the symbols, the 
easier it will be to shift from the initial 
organization to the subsequent one. 

III. (a) If one can physically sep- 
arate groups of symbols and can 
change the stimulus field through 
repeated voluntary manipulations, his 
sorting behavior will be improved. 
(6) And if manipulation increases the 
number of concepts attained, then the 
lower the level of abstraction of the 
symbols, the greater will be the ad- 
vantage of manipulation. 


Tue INDEPENDENT VARIABLES 


The level of abstraction of the symbols.—In addi- 
tion to the direct perceptual organization obtained 
in the discrimination of a given picture or written 
name there is the recognition in certain situations 
that both the picture and the name of an object 
aresymbols. The symbols represent a class each 
member of which has certain similarities—and 
has differences which are ignored. In their func- 
tion as symbols pictures and words need not be 
considered as different kinds of stimuli, but rather 
as symbols representing different levels of ab- 
straction on a single continuum. The level of 
abstraction is determined by the number of 
defining common characteristics related to the 
symbol. As Welch has indicated, the more inclu- 
sive the class the fewer are the characteristics 
common to its species. “The more inclusive the 
class the more abstract the concept” (10, p. 202). 
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Conversely, the more detailed and more numer- 
ous are the defining characteristics the smaller is 
the class of referents to which the symbol per- 
tains, and the “lower” is its level of abstraction. 
Theoretically, a detailed picture accepted as a 
symbol relates to a more limited class than does 
a name or short title for a corresponding object. 
To delimit verbally the same class symbolized by 
a picture might require many pages of descrip- 
tion—and perhaps it could never be done exactly. 
Different persons, however, may interpret the 
same verbal and pictorial symbols at many differ- 
ent levels of abstraction. In this experiment it 
was assumed that for most Ss the interpreting 
of the generality of the words and of the pictures 
followed the theoretical relationship described 
above. 

Four levels of abstraction were used. Short 
titles were considered to be more inclusive than 
long titles, long titles more inclusive than 
drawings, and drawings more inclusive than 
photographs. 

Shift-—Hanfmann and Kasanin (3) have pre- 
sented evidence that there are differences in the 
ability of normal Ss to voluntarily restructure 
their perceptual field. But, success in shifting 
may not be only a function of individual abilities 
but may be in part determined by the level of 
abstraction and the configuration of the symbols 
to be reorganized. 

In several experiments performed previously 
by others (1, 3) it has been determined whether 
or not S is able to shift voluntarily, to reorganize 
the stimuli when asked to do so by Z. The only 
quantitative measure usually taken is the num- 
ber of shifts that can be made with the same 
stimuli. In the present experiment the effect of 
an initial shift was indirectly measured by com- 
paring the performance of those who were pre- 
sented with a field which was not conceptually 
organized, with the performance of those who 
had had to restructure a completely organized 
stimulus field. 

Manipulation.—It was anticipated that the 
number of concepts that would be attained was 
in part a function of the spatial relationships 
between the objects. This followed from the 
assumption that the nearer two stimuli were in 
space (everything else being equal) the more 
likely it would be that they would be classified 
together. And, it was assumed that the fewer 
stimuli there were to be considered the more 
readily could one perceive existing relationships 
between them. By allowing some Ss to manip- 
ulate the symbols, to move the cards about, 
smaller distances between them would be fre- 
quently attained, and by distinctly separating 
completed groups from the remaining field the 
further organization of the remaining symbols 
would be made easier as their number decreased. 
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This variable might also be described in terms 
of difference in isolation. For, when the posi- 
tions’ of all symbols were fixed and there were 
irrelevant symbols between related ones, it was 
difficult to isolate the irrelevant symbols, to be 
constantly aware of their essential differences. 

Should one consider isolation and manipula- 
tion to be separate variables, then their effects in 
this experiment would be confounded. But how 
isolation might be measured independently of 
grouping is not readily apparent. King’s factor 
analysis (9) indicated that the tests in his battery 
were not effective for making such a distinction. 


MeTHOD 


The same familiar objects were presented to 
every S by means of words or pictures, and the 
spatial arrangement of these was exactly the 
same. The S’s task was to find and label as 
many mutually exclusive three-card groups as 
he could within the timed period. The score 
was the number of concepts attained which met 
the criteria. 

Half of the Ss saw cards with names and half 
saw cards with pictures of corresponding common 
objects. And of those who grouped names, half 
grouped “long names” and half grouped a corre- 
sponding set of “short names.” Also, of those 
who grouped pictures, half worked with photo- 
graphs and half used simple line drawings. 

Half of all Ss were told initially that each of 
the six columns of cards represented a class of 
objects. These five-object classes were named 

y S when possible; if not, they were explained 
by £. The other Ss were not informed of this 
“irrelevant” grouping, but they were asked at 
the end of the experiment whether they had 
observed such relationships. 

Half of all Ss were encouraged to move the 
cards about on the board and to put all cards of a 
group together in a specially provided frame 
before they recorded their title for the group. 
The other Ss were not permitted to move the 
cards, but were instructed to write the title of a 
group and the numbers of the respective cards on 
the record sheet. 

There were 16 conditions with eight Ss treated 
alike under each condition. The Ss were assigned 
to conditions at random. 


Materials 


Stimulus objects.—Thirty familiar objects were 
selected so that it was possible to classify them 
into ten mutually exclusive groups of three ob- 
jects each. There were four different packs of 
cards which represented these “same” objects. 
The objects were: lantern, clock, canoe paddle, 
egg beater, umbrella, stool, pail, eyeglasses, bi- 
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cycle, refrigerator, brassiere, pencil sharpener, 
pen, searchlight, basket, scale, curtains, safe, 
washtub, speedometer, saddle, cupboard, awn- 
ing, typewriter, cigarette lighter, camera, fishing 
reel, knife, ear muffs, fan. 

In the first pack there was a printed photo- 
graph representing each object, mounted on a 
white 3 X 5-in. card. The photographs were cut 
from mail-order catalogs. In the second pack 
there was a drawing of each object made in black 
ink with an A-4 Speedball pen on a white 3 X 5- 
in. card. The outline of the drawing was iden- 
tical to that of the photograph, and a minimum 
of detail was included. In the third pack a “long 
name” (two or three words) for each object was 
printed on a white 3 X 5-in. card. And in the 
fourth pack a “short name” (one or two words) 
was printed in the same way. In every case the 
“long name” included the “short name” plus one 
or two adjectives, e.g., alarm clock. The “short 
names” are given in the preceding paragraph. 
The list of “long names” is: kerosene lantern, alarm 
clock, wooden canoe paddle, hand egg beater, 
lady’s umbrella, high stool, steel pail, silver frame 
eyeglasses, man’s bicycle, modern refrigerator, 
“uplift” brassiere, wall pencil sharpener, old 
fountain pen, motorboat searchlight, basketball 
basket, household scale, window curtains, floor 
safe, round galvanized washtub, auto speedom- 
eter, western saddle, china cupboard, striped 
awning, portable typewriter, table cigarette 
lighter, “candid” camera, casting fishing reel, 
butcher knife, woolly ear muffs, and electric fan. 

There was also a practice set of six objects, 
made up similarly with four packs. 

The four packs of cards were prepared to rep- 
resent four levels of abstraction. For example: 
one of the photographs represented a particular 
brand and style of a round-faced spring-wound 
alarm clock. The ink drawing of the clock with 
many details of the face omitted was still readily 
recognizable as a round-faced alarm clock, but it 
would have been difficult to correctly identify 
the brand, for several manufacturers have made 
clocks which are similar enough so that the out- 
line drawing might have represented any one of 
a number of models. The drawing, then, was 
considered to be an adequate symbol for all of 
the clocks of the particular model photographed, 
plus additional similar clocks of other manufac- 
turers. The long name, “alarm clock,” was a 
symbol which represented all of the clocks similar 
to the drawing, plus square-faced alarm clocks, 
pocket alarm clocks, etc. And the short name, 
“clock,” represented all alarm clocks, plus grand- 
father clocks, office clocks, auto clocks, etc. 
Since each of these symbols—the photograph, 
the drawing, the “long name,” and the “short 
name”’—included a wider class of objects than 


the preceding one, each was considered to be at a 
higher level of abstraction. 

Sorting board.—In order to keep the distances 
between the cards constant for all Ss throughout 
the experiment a slotted sorting board was used. 
The plywood board 37 in. by 24 in. over-all had 
30 shallow slots, one for each card. Each slot 
was 5 in. long, 2} in. wide, and } in. deep. There 
were six slots spaced } in. apart in each row, and 
there were five rows spaced 2 in. apart. The 
cards rested in the slots at a slight angle, which 
increased their visibility to S. 

Frames.—Ten biack wooden frames were 
made, each constructed of }-in. by 4-in. strips. 
The outside dimensions of each rectangular frame 
were 4 in. by 174 in. The frame fitted easily 
over any three adjacent slots (cards) in a row. 


Procedure 


While S was being given instructions, the sort- 
ing board was covered so that he could not see 
the 30 cards which were already distributed in 
the appropriate spaces on the board. Each S 
used a mimeographed record sheet and a dupli- 
cate was maintained by £. If E£ suspected that 
S had made an error in recording a card number, 
or if E could not read S’s title, he verified the 
entry as soon as it was made. For the last 64 
Ss, E recorded the time to the nearest minute 
when each concept was recorded. 

Prior to the experiment each S was asked 
whether anyone had spoken to him about the 
details of the experiment. All Ss stated that no 
previous information had been received. All Ss 
were requested not to communicate with others 
regarding the experiment. 

The E then gave the following instructions: 
“On this board before you are 30 cards with pic- 
tures (names) representing common objects. If 
you do not know what any one of the objects is, 
ask me about it when you first see the cards. I 
would like you to see how many separate groups 
of three objects you can find. When you have 
found three objects that belong together, write 
the numbers of the cards on this record sheet, 
here, and write your name or title of the group 
over here. Now this is most important! The 
title should not apply to any other object on the 
board other than these three. That is, no more 
than three objects can be described by this title. 
The title may have more than one word. 

“Briefly, my purpose is to learn how many 
separate groups you can discover. You will have 
plenty of time—25 min. I will tell you when 
you have but 5 min. to go. 

“But before you begin, let me show you a 
sample of six cards. Do you see a group of three 
here? I can group (E manipulates cards and 
uses frames for those Ss who will do the same 
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during the experiment) the boy, the girl, and the 
chicken, No. 5, No. 3, and No. 4 together as 
‘creatures that walk on two legs,’ or simply 
‘bipeds.’ And, I can group the horse, dog, and 
deer, No. 2, No. 1, and No. 6 together as ‘four- 
legged animals.’ I would not be able to pick 
just three of these cards, say No. 1, No. 2, and 
No. 3 and label them ‘living beings,’ because 
this title would apply to more than three of the 
cards. If I used the name ‘domestic animals’ 
I would have one correct group of three—the 
chicken, the dog, and the horse, No. 4, No. 1, and 
No. 2. But then I could not classify the boy, 
the girl, and the deer together. I would have 
but one group instead of two. And the aim, you 
recall, is to get as many groups as possible. Do 
you have any questions? 

“Let me review the main rules of the problem. 
I will put this large card with the rules there so 
you may refer to them later if you wish. (1) 
Find as many three-object groups as you can. 
(2) Write the card numbers and the title of each 
group. (3) The title of the group should not 
apply to any of the other objects on the board. 
(4) An object can appear in only one group.” 

Manipulation Ss—The special instructions 
were: “You may move the cards about the board 
from place to place, just as long as each card rests 
in one of the openings. As soon as you have 
found three cards that belong together put them 
in a row adjacent to one another, and put one of 
these black frames around them—before writing 
the numbers and group title.” 

Nonmanipulation Ss—The special instruc- 
tions were: “You are not to touch or move the 
cards during the experiment. As soon as you 
have found three objects which form a separate 
group write the card numbers and the group 
name on the record sheet. If you decide to 
change a grouping, line it out and use another 
row on the record sheet. Now here is the board 
with the test objects.” 

Shift Ss.—The special instructions were: “No- 
tice that the five objects in each column have 
some relationship. What are the things in this 
column?” If there was no reply in 10 sec. or an 
inadequate title was given: “These are containers 
for liquid.” Similarly, if S could not name one 
or more of the other columns: “These all have 
glass faces,” “These are all sports equipment,” 
“These are all kitchen items,” “These are made 
with cloth,” “These are all office equipment.” 
And then, “Now, your task will be to find 3-card 
groups so that the titles apply only to three 
cards, not to four, five, or more.” 

Nonshift Ss.—At the end of the period they 
were asked: “When you began this task did you 
notice any column of five cards which was a 
group in itself? What would you call it (them) ?” 
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Subjects 


The Ss were 128 volunteer students enrolled 
in elementary psychology at the University of 
Pennsylvania, 65 women and 63 men. One S 
failed to take the experiment seriously and was 
replaced. 


Scoring 


The measure of S’s performance was the num- 
ber of separate concepts attained. Although Ss 
searched for mutually exclusive concepts, an 
absolute criterion of mutual exclusiveness was 
considered to be impractical in scoring. The 
criteria of a separate concept used in scoring were 
as follows: (a) The title of the group of objects 
had to include all three members of the group. 
(6) Every other object on S’s record sheet was 
checked to see whether it also was included within 
this title. If two other objects were included, the 
concept was cancelled. If one other object was 
included, the concept was acceptable, but the out- 
side object was marked so that the following 
action might be taken when necessary. (c) 
Whenever a group had two or three objects 
which had been scored as being included within 
other titles, then this concept was cancelled. For 
example, when S using “long names” grouped 
and labeled the objects as below, 


portable typewriter 
floor safe 

high stool 
Hand-cranked objects wall pencil sharpener 
hand egg-beater 
casting fishing reel 
man’s bicycle 
basketball basket 
wooden canoe paddle 


Office equipment 


Sporting goods 


he was credited with two separate concepts 
rather than three. The concept “hand-cranked 
objects” was cancelled because a pencil sharp- 
ener is an item of office equipment and a fishing 
reel is a sporting goods item. (¢d) When a con- 
cept was cancelled it was eliminated entirely from 
the scoring of other concepts. 

Judgment was still required in the determi- 
nation of which objects were included under each 
of the titles used. To obtain a measure of the 
effects of variability in judgment upon the scores 
obtained, the two duplicate record sheets for 
each S were scored independently by two well- 
trained scorers. The correlation between the 
scores for the 128 Ss by the two scorers working 
independently was .93. There was no case in 
which the two scorers differed by more than one 
concept. The mean for one scorer was 7.29 and 
for the other it was 7.44, a nonsignificant differ- 
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ence. The interpretation of results would be the 
same for the data of either scorer. The results 
as scored by the author are given in this report. 


REsULTs AND Discussion 


As the maximum possible score was 
ten and the minimum score was ex- 
pected to be two or three, it was recog- 
nized from the start that there would 
be a narrow range of discrete measures. 
There were five Ss who were given 
scores of ten by both scorers, and ap- 
proximately one-half of the Ss had 
scores of eight orover. The “ceiling” 
on performance may have leveled the 
differences between the experimental 
treatments. The lack of discrimina- 
tion among superior Ss tends to add 
greater import to the highly significant 
differences obtained. 


TABLE 1 


Tue Numser or Concepts ATTAINED 

















Condition N Mean SD 
Short names 32 7.56 1.09 
Long names 32 5.90 1.40 
Drawings 32 7.80 1.13 
Photographs 32 7.88 99 
Manipulation 64 7.56 1.35 
No manipulation 64 7.01 1.42 
With shift 64 7.40 1.51 
No shift 64 7.17 1.29 














Type of symbol.—An analysis of 
variance (Table 2) indicates that one 
can state with high confidence that the 
type of symbols used to represent a 
number of familiar objects does in- 
fluence the efficiency with which 
college students can organize these 
objects into separate classes. Per- 
formance with “long names” is sig- 
nificantly inferior to that with “short 
names” (p <.01), and it is also 
significantly less than that with draw- 
ings or photographs (p <.01). But 
there is no convincing evidence that 
the “short names,” the line drawings, 























TABLE 2 
ANALYsIs OF VARIANCE 
Source df i. PF 

Symbols 3 | 27.8 | 21.4* 
Shift 1] 18 
Manipulation 1} 9.6 gue 
Symbols X shift 3| 0.01 
Symbols X manipulation 3} 16 
Shift X manipulation 1} 0.6 
Symbols X shift X manip- 

ulation 3} 2.3 
Residual 112} 1.3 
Total 127 

*Pm = 3.96. * Po» = 6.87. 


or the photographs have differential 
quantitative effects. 

In view of the results one might ten- 
tatively generalize that for college 
students the efficiency of conceptu- 
alizing is directly related to the level 
of abstraction of the verbal symbols for 
objects. Such a statement, based 
upon the direct relationship obtained, 
is contradictory to Hypothesis I, 
which proposes an inverse relationship 
between the level of abstraction and 
the number of concepts attained. 

There were, however, significantly 
more concepts formed with pictures 
than with “long names,” as predicted 
by this hypothesis. This result and 
the one previously discussed, there- 
fore, cannot be accounted for in terms 
of a single principle of generality of 
meaning (level of abstraction). In 
order to explain the obtained results a 
second variable might be considered. 
It is suspected that differences in the 
perception of words and pictures may 
have an effect upon the efficiency of 
their organization. 

Shift-—There is no evidence that 
the initial shift in this experimental 
situation had any quantitative effects 
upon the efficiency of conceptualizing. 
The difference in the number of con- 
cepts obtained between those who had 
to shift from five-card to three-card 
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categories and those who immediately 
began the organization of the smaller 
groups was not statistically significant. 

There are three interpretations of 
these results which may be considered. 
First, one might reject Hypothesis Ila, 
and tentatively accept the statement 
that a previous organization of per- 
ceived objects has little or no quanti- 
tative effect upon a subsequent con- 
ceptual organization of these same 
objects. Or secondly, one might say 
that some other factor in this experi- 
ment counteracted the inhibitory 
effects of shift. A possible factor 
which may have facilitated the per- 
formance of Ss who had to shift is 
transfer effect. The S finds or learns 
six classifications which are satisfac- 
tory to E, and he may note that objects 
are grouped together which have a 
common use, a common location, or a 
part in common, and he may apply 
these relationships when confronted 
with the main task which followed. 
Finally, some Ss may have considered 
the preliminary five-card grouping as 
a trivial diversion, as they already 
knew that their main task would be to 
find three-card groups. Such an atti- 
tude would be likely to reduce the 
effects of the shift. 

it was anticipated that some Ss in 
the nonshift group might immediately 
recognize the five-object categories and 
then would find it necessary to make 
the same shift as those in the other 
experimental group. To determine 
whether this had occurred, Ss were 
questioned at the end of the experi- 
ment, and 52 of the 64 Ss in this group 
reported that they had not observed a 
relationship among the five cards of 
any column. No S reported having 
recognized that every column was a 
related group in itself. It can be said, 
therefore, that generally the Ss of this 
experimental group did not have to 
make an initial shift. 


After half of the results had been 
collected and it was recognized that 
the shift was not having the predicted 
quantitative effects, E decided to 
record the time when each concept 
was recorded in order to determine 
whether the initial shift caused a delay 
in the attainment of the three-object 
categories. Since the total time was 
more than adequate for most Ss, it was 
considered possible that there were 
immediate inhibitory effects of shift, 
but that these effects could be over- 
come after a sufficient time. How- 
ever, with 34 Ss in the shift group and 
30 in the nonshift group, and by tabu- 
lating the number of concepts attained 
in each of the five 5-min. intervals, no 
significant evidence of an initial delay 
was obtained. 

Manipulation.—There is convincing 
evidence (significant at the .01 level) 
that the manipulation of the stimulus 
objects facilitated concept attainment 
(See Table 2). Hypothesis IIIa has 
been confirmed. Those Ss who were 
allowed to change the spatial config- 
uration of the cards and to isolate the 
completed groups were able to attain 
more concepts than those Ss who were 
not permitted to touch the cards. 

The effect of manipulation in this 
experiment is interpreted as a com- 
bined effect of two factors, changes in 
spatial arrangement and the isolation 
of completed groups. The more often 
the cards were moved about the board 
the more likely it was that S would see 
a relationship between adjacent cards. 
As the 30 cards on the large board 
could not be in S’s single span of 
attention, it would seem likely that 
the relationships between objects 
would be more readily discriminated 
when the three cards to be related 
were within a single span. The S who 
could not manipulate the cards was 
more dependent upon memory. How- 
ever, E observed that very few Ss 
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actually moved the cards about in an 
exploratory fashion. Many Ss in this 
group did not change the position of 
the cards until they had attained a 
concept. In fact, in spite of explicit 
instructions to put the cards adjacent 
to one another first and then to write 
the title of the group, many Ss wrote 
the title before they put the three 
cards together. Another common 
practice was to place two related ob- 
jects together and then to search the 
board for a third member of the group. 
These two practices would presum- 
ably tend to reduce the advantage of 
manipulation. 

There was behavioral evidence that 
the physical isolation of the completed 
groups did make it easier for Ss to 
concentrate on the remaining cards. 
The S who could not manipulate the 
cards had to keep referring back to his 
record sheet to determine which ob- 
jects had already been grouped. Sev- 
eral Ss who could not move the cards 
complained that they kept forgetting 
which cards they had already grouped. 

The interaction predicted between 
manipulation and the level of abstrac- 
tion of symbols was not evident. In 
fact, manipulation had a somewhat 
greater effect with verbal symbols than 
it did with pictorial ones, although the 
interaction was not statistically signifi- 
cant. This possible reverse relation- 
ship might well be tested in an experi- 
mental design which would permit a 
wider range of scores and which would 
have a higher “ceiling.” 

Variety of concepts used.—The great 
diversity of concepts given by Ss under 
each condition lends credence to the 
belief that the use or attainment of 
concepts is an active organizational 
process and not merely the recall of 
titles for fixed patterns. There were 
212 different combinations of three 
cards used, and in many cases for the 
same three cards different concepts 


were attained by different Ss. For 
example: the refrigerator, safe, and 
cupboard were called “things for stor- 
age” by some Ss, “rectangular arti- 
cles” by other Ss, and “objects with 
doors” by still others. The number 
of Ss who used each combination of 
objects was tallied, and in cases where 
clearly more than one concept was 
used for the same objects the repeti- 
tions of each concept were tallied sep- 
arately. Only one concept was shared 
by more than half the Ss. This con- 
cept was used by 68%. 

The difference in the mean number 
of concepts attained with names and 
pictures cannot be accounted for by a 
particular concept (or concepts) at- 
tained under one condition but not 
underanother. There was nocombina- 
tion of two concepts which accounted 
for half of the difference between“long 
names” and “short names,” or be- 
tween “long names” and the pictured 
symbols. 

For each kind of symbol—“short 
names,” “long names,” drawings, and 
photographs—there were more than 
100 different concepts used. The dif- 
ferences between these numbers of 
different concepts were not significant 
by the chi-square test. However, it 
is important to note that Ss working 
with verbal symbols attained a greater 
number of unique concepts than Ss 
working with pictorial symbols. A 
unique concept is defined as one which 
was attained only once during the 
course of the experiment. “Long 
names,” which gave the least number 
of concepts per S, gave the greatest 
number of unique concepts. There 
were 69 unique concepts with “long 
names,” 56 with “short names,” 43 
with drawings, and 43 with photo- 
graphs. For the null hypothesis 
x? = 8.8, and p < .05. This result 
suggests that the more difficult is the 
task of conceptual organization the 
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more diversified and original are the 
concepts attained by intelligent Ss. 

There is also evidence that Ss who 
were presented with different types of 
symbols tended to form different types 
of concepts. For example, more Ss 
who used the “short names” attained 
concepts based on common use than 
did those who used drawings. This 
statement is derived from an analysis 
of the most frequently attained con- 
cepts—those attained by eight or more 
Ss. The analysis was limited to these 
23 most frequently attained concepts 
because differences in rarely attained 
concepts would be less likely to be 
reliable, and because the analysis is 
exploratory rather than comprehen- 
sive. The classification of concepts 
was accomplished by first defining a 
number of types and then having the 
two scorers classify each of the 23 most 
commonly attained concepts. For 
example: when the refrigerator, safe, 
and cupboard were grouped together 
and called “things for storage,” this 
concept was classified as one based on 
common use. When for the same 
objects the title “rectangular boxes’’ 
was used, the concept was classified 
as one based on common shape. There- 
fore, the classification of concepts into 
types was not determined by the 
groups of objects themselves, but by 
the type of relationship indicated by S 
in his title. The scorers were able to 
classify the 23 most frequently at- 
tained concepts into five types: use, 
shape, parts, action, and “other” (one 
concept). Other types defined, such 
as common location and common ma- 
terials, were discarded. 

From the results of this classifica- 
tion one may tentatively accept the 
statement that there is a relationship 
between the type of symbols and the 
type of concepts attained.? The null 


2 A table indicating the number of subjects of 
each experimental group who attained each of 


hypothesis can be rejected at better 
than the .01 level of confidence. When 
grouping verbal symbols there appears 
to be a greater tendency to attain con- 
cepts based on common use than when 
grouping pictorial symbols. Andwith 
pictorial symbols, conversely, there is 
a greater tendency to attain concepts 
based upon common parts. Although 
these relationships may be a function 
of this particular group of objects, 
future Es should consider not only the 
effects of various symbols upon the 
efficiency of conceptualizing and think- 
ing, but also their effects upon the 
nature of the concepts formed. 


SUMMARY 


The aim of the experiment was to 
determine the quantitative effects of 
the level of abstraction of symbols, of 
a shift, and of manipulation upon the 
efficiency of concept attainment. The 
measure employed was the number of 
separate concepts attained by S. An 
analysis also was made of the specific 
concepts attained under each experi- 
mental condition. 

Four different sets of symbols which 
represented the “same” group of famil- 
iar objects at different levels of abstrac- 
tion were used. These were photo- 
graphs, line drawings, “long names,” 
and “short names.” The symbols in 
each set were presented simultane- 
ously and no memorizing was involved. 
Possible proactive inhibitory effects 
of a shift in conceptual organization 
were investigated by requiring some 
Ss first to label each column of five 
objects, and then to perform the main 


the twenty-three most frequently attained con- 
cepts, with concepts classified according to their 
type, and the definition of concepts may be 
obtained by ordering Document No. 3626 from 
the American Documentation Institute, 1719 N 
St. N. W., Washington 6, D. C., remitting $1.00 
for 35 mm. microfilm or $1.00 for photocopies 
(6 by 8 in.). 
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task—to find as many separate groups 
of three objects as they could. Fin- 
ally, some Ss were permitted to manip- 
ulate the cards with the symbols, i.e., 
to change their spatial relationships 
and to isolate completed groups. 

With a sample of 128 college stu- 
dents it was found that: 


1. The Ss attained more separate 
concepts with pictures of common 
objects than they did with the “long 
names” of these objects. 

2. They attained a greater number 
of concepts with “short names” than 
they did with the longer, more specific 
names. This result contradicted the 
hypothesis that the lower is the level 
of abstraction of the symbols to be 
grouped, the more concepts an indi- 
vidual will generally attain. 

3. There was no evidence that an 
enforced initial shift in conceptual 
organization had a quantitative effect 
upon the efficiency of conceptualizing. 

4. More concepts were attained 
when manipulation was permitted 
than when Ss did not move the 
symbols. 

5. The Ss tended to form different 
types of concepts with different types 
of symbols, and there was more diver- 


sification of concepts the more difficult 
the task of conceptual organization. 


(Manuscript received October 27, 1951) 
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STUDIES OF DISTRIBUTED PRACTICE: VII. LEARNING 
AND RETENTION OF SERIAL NONSENSE LISTS AS 
A FUNCTION OF INTRALIST SIMILARITY 


BENTON J. UNDERWOOD! 


Northwestern University* 


It is a well-established fact that the 
learning of lists of serial nonsense syl- 
lables is facilitated by distributed 
practice (e.g., 4, 8, 9). It has also 
been demonstrated that increasing the 
length of a list (6), increasing the rate 
of presentation of the items from 2 sec. 
to 1 sec., (4), or decreasing the mean- 
ingfulness (9), will tend to enhance 
the effects of distributed practice. 
Thus, it might seem that the more 
difficult the learning task the greater 
the probability that facilitation in 
learning will result from distributed 
practice. However, it has never been 
shown that this is a general law even 
within the restricted task of learning 
serial nonsense lists. That is, while 
increasing the length of the list or 
increasing the rate of presentation 
may increase the facilitation in learning 
by distributed practice, such a rela- 
tionship may be produced entirely by 
a factor or factors intrinsic to these 
two methods of changing the difficulty 
of the task. It is quite possible that 
when other variables are used to 
change the level of difficulty, the rela- 
tionship will not hold. 

One variable which should be effec- 
tive in changing the learning time of 
serial lists of constant length is intra- 
list similarity. The first purpose of 
the present experiments is to vary 
difficulty of serial nonsense lists by 


1Ross L. Morgan supervised the gathering 
of the data; E. James Archer is largely respon- 
sible for the statistical analyses. 

? This work was done under Contract N7onr- 
45008, Project NR 154-057, between North- 
western University and The Office of Naval 
Research. 


manipulating intralist similarity in 
order to observe whether facilitaf¥on 
by distribution and difficulty are di- 
rectly related. 

The second purpose of this investi- 
gation is to examine further Gibson’s 
(1) theory of verbal learning. One of 
Gibson’s predictions is that as intralist 
similarity increases, rate of forgetting 
increases. More specifically, Gibson 
holds that one important factor deter- 
mining rate of learning is intralist 
generalization. The greater the gen- 
eralization the slower the learning; 
differentiation among items must be 
established as an integral part of the 
learning process. Furthermore, Gib- 
son assumes that with the passage of 
time following learning, the generali- 
zation tendencies will recover spon- 
taneously to produce intralist inter- 
ference at recall. The greater the 
amount of intralist similarity (leading 
to generalization) the greater will be 
the rate of forgetting. A previous 
study by the present writer (10), using 
paired adjectives, has failed to confirm 
this expectation, and Gibson’s (2) own 
experiment (also using paired associ- 
ates) neither supports nor denies the 
theory as applied to retention. The 
present studies, using serial nonsense 
lists with different amounts of intralist 
similarity, will make an additional test 
of the Gibson hypothesis. 


PROCEDURE 


Materials.—All lists contained 14 items, but 
since the first item was used only as an antici- 
patory cue, S actually learned 13 syllables. For 
all lists, irrespective of level of similarity, the 
meaningfulness value was 46.67% to 53.33% as 
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determined by Glaze (3). Similarity was varied 
in terms of frequency of repetition of letters. 
The complete set of rules used to construct the 
lists will not be presented but the method em- 
ployed to vary intralist similarity must be indi- 
cated in some detail. 

Each syllable has two consonants and one 
vowel, the first and last letters being consonants. 


Similarity was varied in terms of frequency of’ 


repetition of first letters and frequency of repeti- 
tiom. of last letters. In the low-similarity lists 
there was no repetition of first letters and no 
repetition among last letters. The repetition 
among first and last letters was as low as possible 
within the limits imposed by the number of 
letters available. In the medium-similarity lists 
seven different letters were used to start a syl- 
lable, each one occurring twice. The same was 
true among the last letters but there was no 
overlap among the letters used as first and last 
letters. In the high-similarity lists four letters 
were used three times each as the first letter and 
one was used twice. Using different letters, the 
same rules held for the last letter of the syllables. 
Vowels remained relatively constant for all levels 
of similarity, each being used about equally often 
in each type of list. 


For each level of similarity three 
lists were constructed. These are 
shown in Table 1. A given S learned 
only the lists within a given degree of 
similarity so that the same syllables 
might be used for more than one level 
of similarity. All lists were presented 
by a Hull-type drum at a 2-sec. rate. 


Design.—Three independent experiments were 
performed. Each experiment dealt with a dif- 
ferent level of similarity and differed from the 
others only in that respect. Within each experi- 


ment there were three experimental conditions, 
differentiated by the length of the intertrial rest. 
In one condition 2 sec. elapsed between each trial 
(massed learning). In the other two conditions 
(distributed practice) the intertrial rests were 30 
and 60 sec. 

With three conditions and three lists, both 
lists and conditions were counterbalanced by 
using 36 Ss. The advantages of this design 
have been detailed elsewhere (11). 

A common practice list was used for each 
group of 36 Ss. On the practice day S learned 
this list to a criterion of seven correct responses 
on a single trial by massed practice, and then 
learned to a criterion of one perfect recitation 
with a 30-sec. rest between each trial. The 
activity used to fill this distribution interval, as 
well as those on the experimental days, was sym- 
bol cancellation. This task has been described 
previously (11). 

On the experimental days the list was learned 
to a criterion of one perfect trial. Twenty-four 
hours later the list was recalled and relearned to 
one errorless trial. Relearning was by massed 
practice in all conditions. Relearning of one list 
and original learning of another took place at a 
single session. 


RESULTS 


Practice list—Evaluation of prac- 
tice-day performance will show the 
equality of the three groups of Ss. 
Table 2 gives the mean number of 
trials required to learn the practice 
list to one perfect trial, along with the 
mean number of errors per trial. Anal- 
ysis of variance has shown that the 
within-groups variance is larger than 
the between-groups variance for both 
trials to learn and errors per trial. It 





TABLE 1 
EXPERIMENTAL Lists 


Low Similarity 


VOD IR 


Medium Similarity High Similarity 


QYT YUG MIC HEF KIV GEZ DUF 
HAX YB SOZ NOL SUJ KAM FUW WIQ NOK 
NIW KON FAH JEP XOW TIS XIR TEZ VAY 
AL FEG NUY RUH ZET POB JUW BIH HEF 
IQ VAP LER YAX VOK WUS YUL WEP DAK 
FEC BIH VAQ ZEP XAN KOR FEC TA QIxX 
BA WUS DYP FAH GIJ WEM KAW MI VEB 
LO. HEX IS COQ DAF YOR BAJ DUY 
KEM CAZ UJ JEV SOZ LUB JAL MEP VOK 
RYG SIQ TOF ZUL BEK POY FOV GIJ HEB 
XUN MYD KIV RIQ VUT HAX IR NUY 
RUK GAC CEG QOZ DUY EL WIH HAX 
WEP NAS YOB FAX MUN TOF KOC MAQ NEF 
DUF POV XEL NUV GAC LER JEV BUP QIB 








TABLE 2 


PERFORMANCE OF THE GROUPS ON THE 
Common Practice List 








Trials Errors 
to Learn Per Trial 
Group 





M oM M oM 


Low-Similarity 29.69} 1.38 | 1.51 | .14 
Medium-Similarity | 28.75] 1.83 | 1.55 | .13 
High-Similarity 29.75 | 1.96 | 1.47 | .12 














may be concluded, therefore, that the 
groups were initially comparable. 

Trials to learn.—The basic results in 
terms of the mean number of trials 
required to learn to a criterion of one 
perfect trial with the three levels of 
similarity and the three intertrial rest 
intervals are shown in Fig. 1. 

Three facts, evident from these 
curves, are confirmed by the statistical 
analysis: (a) As intralist similarity 
increases, the mean number of trials 
to learn increases. (b) Learning is 
faster under the two distributed con- 
ditions than under the massed condi- 
tion in all experiments. The F is 6.16, 
and for 2 and 200 df an F of 4.71 is 
needed for the 1% level of confidence. 
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Fic. 1. Learning of serial nonsense lists as a 
function of intralist similarity and intertrial 
rest. 
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(c) The third fact, not clearly appar- 
ent in Fig. 1, is that there is little inter- 
action between intertrial rest and intra- 
list similarity (F is 1.06). Thus, while 
an increase in intralist similarity 
greatly increased the difficulty of 
learning, the learning of the more 
difficult lists is no more influenced 
by distributed practice than the less 
difficult.’ 

Errors during learning.—It was ex- 
pected that as intralist similarity in- 
creased, overt errors per trial would 
increase. However, the statistical 
analysis gives no basis for concluding 
that either intralist similarity or inter- 
trial rest is a significant source of 
variance as far as errors are concerned. 
Thus, wide differences in learning 
appear as a function of intralist simi- 
larity but no differences in error fre- 
quency. Actually, the data show that 
a somewhat greater number of errors 
was made in learning the medium- 
similarly lists than in learning the low, 
but the medium- and high-similarity 
lists produced about the same error 
frequency. For all conditions of inter- 
trial rest combined, the mean number 
of errors per trial made in learning the 
low-similarity lists was 1.21. For 
medium-similarity the value was 1.45, 
and for high, 1.49. 

Recall—The complete statistical 
analysis for the recall scores is shown 
in Table 3. In this analysis the appro- 
priate error term for testing similarity 
is Ss/Similarity. For testing the 
other main effects the proper term is 
Pooled Ss X Practice/Similarity. For 
evaluating list differences within a 


3 The complete statistical analysis of the learn- 
ing scores will not be presented. The method 
used is shown in detail in Table 3 where the 
recall scores are analyzed. One additional find- 
ing of the present analysis may be noted. The 
interaction between learning-how-to-learn and 
similarity was significant at about the 5% level of 
confidence, with the practice effect being greatest 
for the high-similarity lists. 
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TABLE 3 
ANALysis OF VARIANCE OF RECALL ScoRES 
Source of Variation df Mean | pe 
are Square 
Independent observations 
Similarity 2| 15.9383) 1.280 
Ss/Similarity 105} 12.4526) 
Total between Ss 107| 12.5177 
Correlated observations 
Practice 2| 14.7809] 3.065 
Practice X Similarity 4} 12.4059] 2.573 
Intertrial rest 2) 33.2994) 6.905 
Intertrial rest X Similarity 4} 8.0494) 1.669 
Lists/Similarity (pooled) 6| 3.6420 
Lists/Similarity,, 2| 0.7778) — 
Lists/Similarityy.4. 2| 8.0371) 2.008 
Lists/Similarityqig, 2) 2.1112) — 
Pooled Ss X Practice/Similarity 198) 4.8224 
Ss X Practice/Similarity,,,,, 66| 5.6970 
Ss X Practice/Similarityy,4. 66| 4.0025 
Ss X Practice/Similarity gig, 66| 4.7677 
Total within Ss 216) 5.3457 
Total for experiment 323) 7.7216 














* With 2 and 200(198) df, F at .05 level is 3.04, and 
at .01, 4.71. With 4 and 200(i98) df, F at .0S is 2.41, 
and at .01, 3.41. With 2 and 66(65) df, the .05 level 
is 3.14, and the .01 level is 4.95. 


given level of similarity, the appro- 
priate term is Ss X Practice/Similarity 
for that particular level. The two 
most pertinent facts in Table 3 are 
that intralist similarity is not a signi- 
ficant source of variance at recall while 
intertrial rest during original learning 
is. Furthermore, with regard to the 
influence of intertrial rest, the data 
show that recall is better following 
massed than following distributed 
practice. For all experiments com- 
bined, the mean number of items 
recalled 24 hr. following massed learn- 
ing was 5.00. For the 30-sec. dis- 
tributed condition the corresponding 
value was 4.19, and for the 60-sec. 
condition, 3.94. Clearly, then, in 
these experiments, massed learning is 
followed by better recall than is dis- 
tributed learning. 

Several other analyses of the recall 
scores have been made to determine 
exactly on which items massed prac- 


tice was superior. The first analysis 
concerns the relationship between 
number of correct anticipations (rein- 
forcements) during learning and the 
per cent correct at recall. We know 
from the facts of the bowed serial posi- 
tion curve that there are wide differ- 
ences in the frequency with which each 
item is anticipated correctly during 
learning. It is also possible that the 
probability of an item being correct at 
recall is related to its position in the 
list independent of the number of 
reinforcements. Therefore, if we are 
to compare per cent correct at recall 
for different intertrial rest conditions 
as a function of number of reinforce- 
ments, we should first ascertain that 
the shapes of the bowed acquisition 
curves during learning are comparable 
for the different intertrial rest condi- 
tions. This has been done and it can 
be categorically stated that the shapes 
of the three curves (when number of 
correct anticipations is plotted against 
serial position) are highly similar. In 
plotting thesecurves differences in lists 
due to similarity were equally repre- 
sented in all three intertrial rest con- 
ditions, i.e., the three experiments were 
combined. It is therefore reasonable 
to proceed with an item analysis of 
recall as a function of intertrial rest 
during learning with confidence that 
there is little if any position bias con- 
founding the analysis. 

In making the item analysis for 
recall, the total number of correct 
anticipations for each item during 
learning was tabulated along with 
whether or not the item was correct at 
recall. For each of the intertrial rest 
conditions there was 1404 items (13 
items per list X 36 Ss X 3 conditions 
of similarity). These have been cate- 
gorized as items having 1-7 reinforce- 
ments, 8-11, 12-15, 16-20, and 21- 
and-over reinforcements. The num- 
ber of items in each category is in no 











SOF Fd 4 


P 


40} 


30F 





INTER-TRIAL 


20} INTERVAL | 


J 
4 
< 
nd 
Ww 
« 
ke 
< 
i 
1S) 
Ww 
a 
a 
° 
Vv 
i 
z 
ud 
i) 
a 
w 
a 





1-7 8-1) 12-15 = 16-20 2toven 
NUMBER OF REINFORCEMENTS 








-Fic. 2. Percentage correct at recall as a func- 
tion of intertrial rest and number of reinforce- 
ments during learning 


case less than 223. As an illustration 
of the computation: there were 251 
items in the massed condition (2-sec. 
rest) having 1-7 reinforcements. Of 
these 251 items, 42 (16.7%) were antic- 
ipated correctly on the first recall trial. 

Making the above calculations for 
each category of reinforcement for 
each condition of intertrial rest allows 
the plotting of the per cent correct at 
recall as a function of the two vari- 
ables. Such a plot is shown in Fig. 2. 
The one clear fact demonstrated by 
these curves is that recall under the 
massed condition is largely superior 
only in the category where the number 
of reinforcements during learning is 
greatest (21 and over). It will be 
noted that the recall of items in this 
category is over 55% for the massed 
conditions and about 40% for the 
distributed. It might become appar- 
ent at once that this difference could 
be an artifact of the categorization 
procedure. That is, since learning 
under massed practice took longer 
than learning under distributed prac- 
tice, more items having 21l-and-over 
reinforcements would be counted for 
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the massed conditions than for the 
distributed. Furthermore, the mean 
number of reinforcements in this cate- 
gory should be greater for massing 
than for distribution. If there is a 
direct relationship between number of 
reinforcements and per cent correct at 
recall, it would be expected that the 
massed condition would result in better 
recall for items in this category. 
Further analysis shows, however, that 
this is not the case. We have evalu- 
ated only those items in the 2]l-and- 
over category by groupings of two’s, 
i.e., 21-22, 23-24, 25-26, etc. For 
each of these small groupings the per 
cent correct at recall was calculated, 
and in each case recall following mass- 
ing was equal to or superior to recall 
following distribution. It must be 
concluded that the better recall follow- 
ing massed practice for highly rein- 
forced items is not an artifact of the 
grouping procedure but represents a 
true phenomenon. 

We have shown above that the dif- 
ference in recall following massing and 
distribution is largely limited to items 
which have been heavily reinforced. 
By the very nature of the bowed acqui- 
sition curve such items must be near 
the first or near the last part of the 
list. In Fig. 3 we have plotted the 
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absolute number of items recalled for 
each serial position. It is apparent 
that the massed condition obtains 
superiority in recall over the distrib- 
uted conditions largely for items in the 
early portion of the list. 

Relearning.—The mean number of 
trials required to relearn to one perfect 
trial under all conditions is shown in 
Fig. 4. There are three facts of note 
exhibited in Fig. 4: (a2) The greater the 
similarity the greater the number of 
trials required to relearn. The F 
(6.83) is significant well beyond the 
1% level of confidence. (6b) Relearn- 
ing is faster following massed than 
following distributed practice (F is 
9.48). (c) While the curves suggest 
some interaction between similarity 
and intertrial rest, the F is not signifi- 
cant (1.61). 


Discussion 


Little need be said about the learning of the 
lists. As expected (from previous investigations) 
distributed practice facilitated the learning of 
serial nonsense lists. It may be noted, however, 
that no evidence was presented which showed 
that 60 sec. between trials was consistently supe- 
rior to 30 sec. between trials; in two of the experi- 
ments the longer interval facilitated more than 
the shorter, in the other less. There was no 


interaction between intralist similarity and inter- 
trial rest. This means that difficulty of learning 
may be increased without a corresponding in- 
crease in the facilitation produced by distributed 
practice. The differences in trials to learn for 
the different levels of similarity were clear-cut; 
approximately 35% more trials were required to 
learn the high-similarity lists than the low. Thus, 
while it is possible that with greater extremes in 
difficulty (produced by similarity) distribution 
would have a differential effect on learning, there 
is nothing in our data to suggest it. When dif- 
ferences in difficulty produced by the manipula- 
tion of variables other than intratask similarity 
have been shown to produce interaction (4, 6, 9), 
the causal agents may be specific to the variable. 
The other alternative is that interaction between 
difficulty and facilitation by distributed practice 
can only be demonstrated with distribution inter- 
vals longer than those employed here. 

Trials to learn increased directly as a function 
of intralist similarity; errors per trial in learning 
did not. While the medium- and high-similarity 
lists produced more errors than did the low- 
similarity lists, evaluation of the error data as a 
whole gave us no basis on which to state a defini- 
tive relationship. It is our belief that there is a 
direct relationship between similarity and error 
tendencies but that S will only “tolerate” so many 
overt errors. When a maximum is reached, 
further increase in errors may not be expected. 

The findings have demonstrated that recall is 
better following massed practice than following 
distribution. The previous experiment most 
comparable to the present study is that of 
Hovland (5). Using serial nonsense lists and a 
2-min. distribution interval, he found recall to be 
better following distributed practice. Hovland’s 
finding was one of the major empirical contradic- 
tions to the theory developed by Hovland, Hull, 
and others (7). The portion of the theory which 
deals with the present problem postulates a form 
of inhibition which generates during learning and 
dissipates with rest. (In its essential properties 
this inhibition is very comparable to that called 
reactive inhibition.) The effective excitatory 
strength between a stimulus and a response at 
any given moment is the algebraic sum of the 
positive tendency (learning) and the inhibition. 
According to the theory, during massed learning 
there would be little opportunity for the inhibi- 
tion to dissipate; hence, the learning component 
would be considerably greater than the perform- 
ance would indicate because it was being counter- 
acted by the inhibitory component. In distrib- 
uted practice, on the other hand, inhibition would 
remain fairly well dissipated during acquisition. 
Such a theory, of course, gives a fairly straight- 
forward accounting of the fact that learning is 
faster under spaced than under massed practice. 
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However, the theory must also predict that reteri- 
tion following massed practice will be better than 
following distributed practice. This follows from 
the fact that given a rest interval the inhibition 
accruing during massed learning will have dissi- 
pated, thus leaving the greater learning com- 
ponent developed under massed practice to 
demonstrate itself in better recall performance. 
Hovland, as mentioned earlier, found the opposite 
result. This contradiction to the theory was 
never satisfactorily resolved. 

The present results would seem (on the sur- 
face) to confirm the Hull-Hovland theory. How- 
ever, automatic confirmation of their theory by 
the present results does not necessarily follow. 
We have shown that the better recall under 
massed, as compared with distributed, practice 
occurs with items having many reinforcements. 
Yet, it is not clear in our data whether this differ- 
ence is produced because the recall of the massed 
items is facilitated or because the recall of the 
distributed items is inhibited. By the Hull- 
Hovland theory, the massed items should be 
facilitated. ~However, Fig. 2 more strongly sug- 
gests that the distributed items which have been 
heavily reinforced are inhibited. For, we may 
commonly expect some direct (though not neces- 
sarily linear) relationship between number of 
reinforcements during learning and per cent cor- 
rect at recall (10). Figure 2 shows that under 
distributed conditions items with 2l-and-over 
reinforcements are recalled no better than those 
with 16-20 reinforcements. Thus, it appears 
that the items which are most heavily reinforced 
are being inhibited in some fashion following 
distributed learning. Nevertheless, since we do 
now know the “true” relationship between num- 
ber of reinforcements and recall, there is no way 
by which this issue can be satisfactorily settled 
from analyses of the present data. 

Another approach to the problem is to con- 
sider the first recall trial to be a relatively unstable 
index of learning. We have noted that the recall 
difference following massed and distributed learn- 
ing occurs primarily during the first part of the 
recall trial. This fact may indicate the operation 
of a very temporary inhibiting factor following 
distributed learning. During distributed learn- 
ing, symbol cancellation came between each trial; 
hence, recall on each trial was always preceded 
by this activity. Before recall, no symbol can- 
cellation was given for any condition. This 
change in context between learning and recall for 
the distributed conditions may have been suffi- 
cient to disrupt temporarily the recall perform- 
ance. Such a possibility, of course, is subject to 
experimental test. 

While the interpretation of the recall results 
must remain somewhat inconclusive, the relearn- 
ing results are clearly in accord with the Hull- 


Hovland theory. Since the S-R associations 
formed by massed practice are actually stronger 
than those formed under distribution (according 
to the theory), relearning would be expected to 
occur more rapidly following massed learning. 
This was found in the present results; indeed, the 
massed list was relearned some 25% more rapidly 
than the distributed lists. In short, one cannot 
deny the usefulness of the inhibition theory in 
handling the relearning data; it is unfortunate 
that such clear-cut results (for or against the 
theory) were not found for the recall scores. The 
fact that Hovland found quite the opposite 
results from those presented here when he used a 
2-min. distribution interval indicates, however, 
that the inhibition theory is inadequate in its 
present form to account for the facts of retention 
as they are known at the present. 

Finally, the relation of the retention results 
to Gibson’s theory (1) may be indicated. The 
recall results do not support the Gibson hypoth- 
esis; recall did not vary as a function of similarity. 
This squares with previous findings using paired 
adjectives (10). Gibson’s theory states that 
generalization tendencies recover with the pas- 
sage of time following learning to produce once 
again the interference which resulted in differ- 
ences in original learning. The greater the orig- 
inal generalization (similarity), the greater the 
interference and the poorer the recall. However, 
the relearning scores do support the hypothesis; 
a greater number of trials was required to relearn 
the high-similarity lists than the low. And, 
although we did not present the data, error 
analyses show more errors per trial were made in 
relearning the medium- and high-similarity lists 
than in relearning the lists of low similarity. As 
during original learning, the medium- and high- 
similarity lists could not be differentiated in terms 
of errors during relearning. Thus, the relearning 
data support the Gibson theory; the recall results 
do not. We are unable to tell from our present 
results whether this latter failure is due to inade- 
quacies in the theory or to inadequacies of the 
recall scores as measures of retention. 


SUMMARY 


Three experiments were run to deter- 
mine: (a) the adequacy of the Gibson 
theory of verbal learning for the 
acquisition and retention of serial lists 
of nonsense syllables of varying degrees 
of intralist similarity, and to deter- 
mine (b)if increasing difficulty in 
learning produced by increasing intra- 
list similarity results in a correspond- 
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ingly greater facilitation in learning by 
spaced practice. Each experiment 
differed only in terms of intralist sim- 
ilarity of the nonsense syllables. 
Within each experiment three inter- 
trial rest intervals were used, namely, 
2, 30, and 60 sec. A separate group 
of 36 Ss served in each experiment 
with the three intertrial rest conditions 
and the lists being counterbalanced 
within each experiment. Under all 
conditions the list was learned to one 
perfect trial, and following a 24-hr. 
rest was relearned to one perfect trial 
by massed practice. 

The results show: 


1. Significant differences in rate of 
learning as a function of intralist sim- 
ilarity, with no consistent relationship 
between errors per trial and intralist 
similarity. 

2. No interaction between intralist 
similarity and intertrial rest. Dis- 
tributed practice resulted in faster 
learning than did massed, but this 
facilitation was no greater for the 
high-similarity lists than it was for the 
low. 

3. Recall did not vary as a function 
of intralist similarity but did vary asa 
function of intertrial rest during learn- 
ing, being greater following massed 
than following distributed practice. 
The superiority of recall following 
massed learning was found to be con- 
fined largely to heavily reinforced 
items in the early part of the list. 

4. Relearning was much more rapid 
following massed practice than follow- 
ing distributed practice, and was more 
rapid for the low-similarity lists than 
for the high. 

The recall results do not support the 
Gibson theory; the learning and re- 
learning data do. The retention re- 
sults (both recall and relearning) are 
in direct contradiction to previous 
findings of Hovland using a 2-min. 


distribution interval but are in fair 
accord with his inhibition theory. 
While no completely satisfactory ex- 
planation of the recall results was 
presented, they do not negate the 
Hovland theory and the relearning re- 
sults definitely support the hypothesis. 


(Manuscript received October 29, 1951) 
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EFFECT OF AMOUNT AND DISTRIBUTION OF 


WARMING-UP ACTIVITY ON RETENTION 
IN MOTOR LEARNING! 


REUBEN J. SILVER 
Mental Hygiene Clinic, Fergus Falls, Minn. 


It has been found by Ammons (1) 
and several other investigators that 
initial learning in certain activities is 
impeded by the lack of proper set. 
Learning is then facilitated by the 
acquisition of this set during the first 
few practice trials. Similarly, when 
practices ceases for a period of time, 
e.g., when a rest interval is introduced, 
there occurs a loss of the perceptual- 
motor adjustments which had been 
acquired during the performance of 
the task. This set must then be rein- 
stated when practice is resumed. 

The decrement in learning which is 
brought about by the lack of proper 
set and which is overcome during the 
initial learning or relearning trials 
is sometimes referred to as warm-up 
decrement. While investigating the 
characteristics of the motor learning 
curve following the introduction of a 
rest interval, Ammons hypothesized 
that warm-up decrement should be a 
negatively accelerated function, both 
of the amount of prerest practice and 
the length of the interpolated rest. 
Using the pursuit rotor he (2) was 
able to substantiate these two predic- 
tions, and further, found that warm- 
up decrement reached its maximum 
after about 8 min. of work and after 
about 10 min. of rest. 

A different approach to the problem 


1 This paper is based on part of a dissertation 
submitted in partial fulfillment of the require- 
ments for the Ph.D. degree at New York Univer- 
sity. The author wishes to express his gratitude 
to Professor L. W. Crafts for his valuable aid and 
advice throughout the progress of the study. 
Thanks are also due to Dr. Hilton N. Wasserman 
for his valuable suggestions. 


has been initiated by Irion (4). He 
reasoned that filling the rest interval 
with a set maintaining activity should 
prevent S from losing the perceptual- 
motor adjustments acquired during 
the performance of the task, and con- 
sequently, retention should be im- 
proved. One of his experiments (5) 
suggests that this is true. Using non- 
sense syllables as the learning task and 
48 hr. as the rest interval, he had his 
Ss engage in 0, 1, 5, and 10 trials of 
warming-up activity. The warming- 
up activity was introduced immedi- 
ately before the recall period, and 
consisted of the recitation of three- 
place digits which were presented in a 
manner similar to that in which the 
nonsense syllables had been presented. 
The results of this experiment showed 
that retention increased up to the 
point of five warming-up trials but 
decreased when ten warming-up trials 
were used. 


To explain this finding, it is advantageous to 
employ the concept of reactive inhibition (Jz). 
Hull postulates (3) that once a response is made, 
there is built up in the organism a tendency to 
avoid its repetition. This negative drive is called 
reactive inhibition and is a positive function of 
the amount of work involved in the response in 
question. Further, since reactive inhibition is a 
drive whose goal is cessation of response, it dis- 
sipates during a rest interval. In this way, one 
may explain the advantage of introducing a rest 
interval during learning, or more generally, the 
superiority of spaced practice. 

Hull further indicates that Jz probably is an 
effector phenomenon and does not involve the 
entire sensorimotor arc. Therefore, if the learn- 
ing task and the warming-up activity are similar 
from the point of view of the effectors involved, 
the Jz produced by both should be similar (in 
terms of the effectors involved), and thus, as 
provided by Hull’s theory, it is probable that 
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these two negative drives will summate with each 
other. That is, if approximately similar effec- 
tors are involved, it is justifiable to assume that 
the Jz produced by the learning task will sum- 
mate with the Jz produced by the warming-up 
activity. 

Bearing this in mind we may go a step further: 
Immediately following the cessation of practice, 
the Jz accumulated during the practice period 
should begin to dissipate. With rest, however, 
another tendency results, namely, the loss of set. 
In many instances -the beneficial effect of the 
dissipation of Jz is more than enough to outweigh 
the decrement resulting from the loss of set. Con- 
sequently, reminiscence often is observed. 

With the introduction of the warming-up 
activity, two opposing effects again result: rein- 
statement of set, and the accumulation of Jp 
for the warming-up activity. If the results of 
Kimble (7) concerning the rapidity with which 
Ip accumulates are correct, the effects resulting 
from the reinstatement of set may be unobserved 
because of the additional 7g developed while 
engaging in the warming-up activity. Thus an 
apparent paradox is explainable as follows: 
Warming-up activity aids retention as long as 
the benefits accruing from the reinstatement of 
set are greater than the detrimental effects of the 
accumulation of additional Jz. After a certain 
point, all the perceptual-motor adjustments lost 
during rest are regained, and therefore, further 
warming-up activity can result only in the ac- 
cumulation of additional Jz. Since the Jz pro- 
duced by the warming-up activity can summate 
with the Jz resulting from the performance of 
the learning task, the result should be a decrease 
in performance. 

The results of part of an experiment by 
Kientzle (6) tend to support this hypothesis. 
Using the alphabet printing task, she had one 
group of Ss practice 60 sec. and rest 60 sec. The 
entire interval was spent in making tallies and 
crossbars. It would be expected that with a 
60-sec. work period the optimal amount of warm- 
ing-up activity would be quite small. That is, 
it would be expected that 60 sec. of warming-up 
activity would be greater than the optimal 
amount. This inference is supported by Kient- 
zle’s results, which indicate that retention was 
impaired when compared with a similar group 
who did nothing during the rest interval. 

The significance of this discussion for the 
present experiment may be summarized as 
follows: 

First, there should be an optimal relationship 
between the amount of prerest practice, the 
amount of warming-up activity, and retention. 
It has been indicated that when a rest interval 
has been introduced, S loses some of the per- 
ceptual-motor adjustments acquired during the 


performance of the task. The greater the amount 
of prerest practice, the greater is the number of 
adjustments acquired, and susceptible to loss 
during rest. Therefore, the amount of warming- 
up activity necessary to reinstate set should be 
an increasing function of the amount of prerest 
practice. 

Secondly, although warming-up activity rein- 
states set, it also results in the accumulation of 
additional Jz. Preventing S from accumulating 
Iz during the warming-up activity should result 
in superior retention, and this should hold true 
even if the entire interval is filled with the warm- 
ing-up activity. The prevention of the accum- 
ulation of Jz can easily be accomplished by 
distributing the warming-up activity. On the 
other hand, if the warming-up activity is massed, 
this should produce a decrement in performance 
of the learning task when practice is resumed. 
This assumes that the Jpg produced by the 
warming-up activity and that produced by the 
learning task summate, provided both activities 
are similar. 


The above discussion leads to three 
predictions which the present experi- 
ment was designed to test: (a) The 
amount of warming-up activity neces- 
sary to reinstate set should be an 
increasing function of the amount of 
prerest practice. (b) With increasing 
amounts of massed warming-up activ- 
ity, retention should increase up to a 
point and then decrease. (c) With 
increasing amounts of distributed 
warming-up activity, retention should 


increase up to a point and then level 
off. 
MeEtTHOD 


Subjects —The Ss were 24 groups of 20 stu- 
dents from classes in general psychology at New 
York University, or City College of New York. 
The experiment was administered to one group at 
atime. Since the data are analyzed by means 
of the analysis-of-variance technique, it is essen- 
tial that the number of Ss in each group be equal 
(10). Although the experiment was administered 
to classes of unequal size, the groups were equal- 
ized by random selection. 

Materials.—The learning task was the inverted 
alphabet task described by Ruch and Warren (8) 
and discussed by Kimble (7). This task requires 
S to print the letters of the alphabet upside down 
and from right to left in such a manner that when 
the paper is rotated 180° the letters appear 
properly written. 
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Procedure.—In Part I of the experiment, Ss 
were divided into three groups, each having a 
different amount of prerest practice. The Ss in 
Group 1 received a 10-min. rest after Trial 5; 
those in Group 2 after Trial 15; and those in 
Group 3 after Trial 30. 

The Ss in each group were divided into six 
subgroups receiving 0, 4, 1, 14, 2, and 24 min. 
of warming-up activity during the rest interval 
and immediately prior to the resumption of prac- 
tice. The warming-up activity required S to 
make tallies and crossbars on a worksheet sim- 
ilar to that used for the alphabet printing task. 

The group receiving 15 trials of prerest prac- 
tice included three additional subgroups, which 
had 3, 4, and 5 min. of warming-up activity, 
respectively. This procedure was utilized to 
facilitate the comparison of the results with those 
of Part II of the experiment. 

The classes constituting the groups having 
various amounts of warming-up activity in each 
condition were assigned at random. This pro- 
cedure was designed to help assure the equival- 
ence of the groups. 

Each S was given two booklets, one for the 
experiment proper and the other for the warming- 
up activity. The one for the experiment proper 
consisted of four work sheets and a title page. 
Each work sheet was set up in such form that S 
was required to print each letter in a }-in. square 
box. Each sheet was divided into nine trials. 
The booklet for the warming-up activity was 
similar to the one used in the experiment proper. 

Each of the experimental groups received 36 
30-sec. trials and a single restof 10min. Follow- 
ing Kimble (7), Ss were told after 29 sec. to 
proceed to the next trial. This same procedure 
was employed even on Trials 9, 18, and 27, after 
which the page had to be turned. The score 
was the number of letters printed, accuracy 
being disregarded. 

Part II of the experiment was essentially sim- 
ilar to the first. Amount of prerest practice, 
however, was held constant at 15 trials. Three 
groups of Ss engaged in 3, 4, and 5 min. of dis- 
tributed practice on the warming-up activity 
during a 10-min. rest, and just prior to the 
resumption of practice. The Ss alternately 
worked 30 sec. and rested 30 sec. during the 
warming-up period. The group that had 5 min. 
of warming-up activity began this practice at 
the start of the rest period. The other groups 
began the warming-up activity after 2 and 4 
min. of the interval had elapsed. 


REsuULTs AND Discussion 


Warming-up activity and prerest prac- 
tice.—All groups were equivalent on 
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Fic. 1. The mean increase in score on the 
trial following various amounts of warming-up 
activity after 5, 15, and 30 trials of prerest 
practice. 


comparable prerest trials.2 The mean 
increase in score on the first postrest 
trial for the 18 groups having various 
amounts of warming-up activity is 
shown in Fig. 1. It is apparent that 
after rest the performance of the Ss 
in the different groups increased up to 
a point, as the amount of warming-up 
activity increased, and then declined. 
The point after which performance 
began to decline is a function of the 
amount of prerest practice. 

In order to determine whether the 
trends depicted in Fig. 1 are signifi- 
cant, an analysis of variances was per- 
formed (Table 1). The F test shows 
the variance due to warming-up activ- 
ity to be significant at the .01 level. 

To evaluate these results further, 
the significance of the differences be- 
tween the increase in score for the 
groups having rest and warming-up 
activity and those having rest without 
warming-up activity was computed. 
The t¢ tests are summarized in Table 2. 
With five trials of prerest practice, 4 
min. of warming-up activity yields the 
maximum retention score. After 30 
trials of prerest practice, performance 
is significantly increased following 1} 
min. of warming-up activity. Al- 
though none of the differences in 


2 An analysis of these prerest scores is pre- 
sented in (9). 
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TABLE 1 


ANALYSIS OF VARIANCE OF THE MEAN INCREASE 
1n SCORE ON THE TRIAL FoLLowinc Various 
Amounts or WarmiNG-uP AcTIVITY 
AFTER 5, 15, anp 30 TRIALS OF 
Prerest PRACTICE 











Source of Variance a df F 
Practice 13.85 2 1.00 
Warming-up activity | 469.23 5 5.49* 
Interaction 262.32 10 1.54 
Error 5842 342 
Total 6587.40} 359 














* Significant at the .01 level of confidence. 


Cond. 15 is significant, the trend is 
similar and suggests that the maximum 
retention is found following 1 and 1} 
min. of warming-up activity. 

The relationship between warming- 
up activity and the amount of prerest 
practice is indicated more clearly in 
Fig. 2. Each point on the curve shows 
the amount of warming-up activity 


ad 


WNUTES OF WARMUING.UP acTiVITY 








s 's 
NUMBER OF PRE-REST PRACTICE TRIALS 


Fic. 2. The amount of warming-up activity 
which produced the maximum relearning score 
after 5, 15, and 30 prerest trials. 


following 5, 15, and 30 trials of prerest 
practice which yields the greatest in- 
crease in score on the first postrest 
trial. It is apparent that the amount 
of warming-up activity which pro- 
duces the greatest increase in score is 
an increasing function of the amount 
of prerest practice. 


TABLE 2 


Mean DirrErRENCE IN INCREASE IN SCORE AFTER REST BETWEEN THE Groups Havinc 
Warminc-Up Activity AND THOSE Havinc No Warminc-UP Activity 

















Amount of Amount of 
Prerest actin antine 2 Ain.) Mean SD ra ” 
0 3.70 2.26 
4 6.90 4.52 3.20 2.52** 
5 Trials 1 4.30 2.59 50 <1.00 
1} 5.05 5.01 1.06 1.06 
2 5.30 3.02 1.26 1.26 
24 3.70 4.36 — 1.65 1.65 
0 4.00 3.49 
. } 5.00 3.15 1.00 <1.00 
15 Trials 1 5.95 4.40 1.95 1.54 
1} 5.80 4.18 1.80 1.42 
2 3.60 3.83 —.40 <1.00 
24 4.30 5.78 30 <1.00 
0 3.80 4.43 
} 4.85 4.39 1.05 < 1.00 
30 Trials 1 6.20 3.46 2.40 1.89 
lk 7.75 4.13 3.95 3.11%" 
2 4.40 3.65 0 <1.00 
24 2.70 3.18 —1.10 < 1.00 




















*?¢ values are based upon the error variance of the analysis of variance in Table 1. 


** Significant at the .02 level of confidence. 
*** Significant at the .01 level of confidence. 
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These results imply that the number of per- 
ceptual-motor adjustments increases as the 
amount of prerest practice increases. Since late 
in learning more adjustments are present, more 
can be lost during the rest interval. However, 
with large amounts of prerest practice, the set 
responses should develop more habit strength, 
and therefore be more resistant to forgetting. 
At first blush, our results seem contrary to the 
concept of habit strength. However, although 
late in learning the reduction constant of the 
forgetting curve is less than it is early in learning, 
the absolute amount of set lost late in learning is 
greater because there is more present that can 
be lost. In other words, although the percentage 
of set lost late in learning is less than that lost 
early in learning, the amount of adjustments lost 
is greater because there is more present that can 
be lost. 

An alternative explanation would be that the 
acquisition of set involves not only the learning 
of new responses, but also the elimination of 
reactions, i.e., the dropping out of superfluous 
responses. Our results suggest that even if this 
latter hypothesis is correct, the number of correct 
responses acquired must be greater than the 
number of incorrect responses lost. If the reverse 
were true, less warming-up activity would have 
been necessary as the amount of prerest practice 
increased. 

The importance of the set responses is also 
revealed in the groups that had no warming-up 
activity. Figure 1 indicates that retention for 
those groups did not vary with different amounts 
of prerest practice. To explain these results it is 
advantageous to assume that Jz must have in- 
creased with learning. The dissipation of this 
greater inhibition offset the greater loss in set 
that occurred when rest was introduced late in 
learning. That more set reactions were lost late 
in learning is strongly suggested by the remainder 
of the curve depicted in Fig. 1. 

Our finding that reminiscence following zero 
warm-up does not vary with increasing practice, 
is not incompatible with the results obtained by 
Wasserman (11). Using the alphabet printing 
task and two degrees of motivation, he found a 
decrease in reminiscence late in learning with 
low motivation but not with high motivation. 
It is possible that the motivation in the present 
experiment was much closer to Wasserman’s 
high-motivation groups. 


Retention following massed and dis- 
tributed warming-up activity.—In Fig. 
3 are shown the mean increases in 
score for the groups having either 
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Fic. 3. The mean increase in score on the trial 
following various amounts of massed and dis- 
tributed warming-up activity after 15 trials of 
prerest practice 


massed or distributed warming-up 
activity. Results of the relevant 
groups of Part I are included in the 
curve for the massed warming-up 
activity. It is evident that the curve 
reaches its peak following 1 min. of 
massed warming-up activity, decreases 
beyond 1} min. of warming-up activ- 
ity, and then levels off following 23 
min. of warming-up activity. Appar- 
ently, warming-up activity in excess 
of a certain amount resulted in de- 
creased performance. The fact that 
progressively increasing amounts of 
warming-up activity beyond a critical 
level does not result in progressively 
inferior retention scores needs expla- 
nation. 

Apparently, after a certain amount 
of warming-up activity most of the 
perceptual-motor adjustments are re- 
instated. After this critical level has 
been reached, the effects of further 
warming-up activity are offset by the 
accumulation of Jz. Our results sug- 
gest that after 24 min. of work on the 
intervening task, the ratio between 
the two variables (Jz and reinstate- 
ment of set) becomes constant. 

Figure 3 also suggests that after 
a certain amount of warming-up 
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activity, performance following rest 
decreases. (This trend is shown 
statistically in Table 2.) Apparently, 
immediately after the optimum point 
has been reached, the detrimental 
effects of Jz are greater than the bene- 
fits resulting from reinstatement of 
set. Hence, before the curve levels 
off, the amount of Jr accumulated 
and the amount of set reinstated is 
weighted in favor of Jp. 

Reference to Fig. 3 indicates that 
the performance of the groups having 
distributed warming-up activity is 
superior to the performance of those 


TABLE 3 


ANALYsIs OF VARIANCE OF THE MEAN INCREASE 
In ScoRE ON THE TRIAL FoLLowinc 
Various Amounts oF MAssED AND 
DistRIBUTED WARMING-UP 
Activity AFTER 15 PREREST 











TRIALS 
Source of Variance ae af F 
Distribution 672.13 1 48.39 * 
Warming-up activity | 66.45 2 2.39 
Interaction 33.22 2 1.20 
Error 1583 114 
Total 2354.80} 119 














* Significant at the .001 level of confidence. 


groups having massed warming-up 
activity. An analysis of variance 
showed the F ratio for distribution of 
practice on the warming-up activity 
to be significant at the .001 level 
(Table 3). These results are analyzed 
further in Table 4 which shows the 
significance of the difference between 
the mean increase in score following 
massed and distributed warming-up 
activity. It is evident that for 3, 4, 
and 5 min. of practice on the inter- 
vening task, distribution yielded a 
significantly greater postrest perform- 
ance. 


TABLE 4 


Mean Increase 1n Score AFTER Rest 
Fottow1nc Massep ANp DistTrIBuTED 
Warminc-up Activity AFTER 15 
Triats oF PREREST 














PRACTICE 

Amount of Massed Spaced 

Warming- t* 

up Activity 

(Min.) Mean | SD | Mean| SD 

3 3.75 | 3.97 | 7.90 | 3.18 | 3.52*** 
4 3.10 | 4.29 | 6.95 | 2.21 | 3.26** 
5 3.75 | 3.28 | 9.95 | 4.38 | 5.26*** 




















*? values are based upon the error variance of the 
analysis of variance in Table 3. 
** Significant at the .01 level of confidence. 
** Significant at the .001 level of confidence. 


It was indicated earlier that dis- 
tributed practice on the warming-up 
activity should prevent the accumula- 
tion of J» for this set-maintaining activ- 
ity. The reactive inhibition which is 
built up with each trial fades during 
the rest interval. Hence, perform- 
ance on the main task cannot be 
depressed by /p. 

Figure 3 also reveals that increasing 
amounts of distributed practice on the 
intervening task tend to produce pro- 
gressively greater increments in per- 
formance without any observable 
leveling off. Possibly all the percep- 
tual-motor adjustments lost during rest 
had not been completely reinstated. 
Even if this explanation is correct, the 
marked rise in performance which is 
found with increasing amounts of dis- 
tributed warming-up activity would 
not be expected. 

However, another factor may have 
been operating. Ammons (1) sug- 
gests that the warming-up effect de- 
creases as the number of rest intervals 
increases. In the groups having dis- 
tributed warming-up activity the Ss 
had had practice in shifting from rest 
to work. This may explain the in- 
crease in retention which occurred with 
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increasing amount of distributed 
warming-up activity. 

In general, it is evident that the 
results of this study have substanti- 
ated the major hypothesis and the 
related predictions, and thus have lent 
support to the theoretical basis of the 
experiment. 


SUMMARY 


Twenty-four groups of 20 Ss were 
given the alphabet printing task in 
order to test the following hypothesis: 
Warming-up activity aids retention as 
long as the benefits accruing from the 
reinstatements of set are greater than 
the detrimental effects of the accum- 
ulation of additional Jp. 

A 10-min. rest pause was introduced 
after 5, 15, and 30 trials of prerest 
practice. In each of these conditions 
Ss were given 0, $, 1, 14, 2, and 24 min. 
of warming-up activity, consisting of 
making tallies and crossbars. Six 
other groups having 15 trials of prerest 
practice received 3, 4, and 5 min. of 
either massed or distributed warming- 
up activity. The Ss in all the groups 
received 36 trials of 30 sec. each. The 
results and conclusions may be sum- 
marized as follows: 


1. The Ss who received warming-up 
activity were significantly superior in 
performance on the trial following rest 
to those who had rest only, with no 
warming-up activity. 

2. The amount of warming-up activ- 
ity necessary to reinstate set was found 
to be an increasing function of the 
amount of prerest practice. This was 
interpreted as indicating that the num- 
ber of perceptual-motor adjustments 
acquired during learning increased 
with increasing amounts of prerest 
practice. 

3. With increasing amounts of 
massed warming-up activity, there was 
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a tendency for retention to increase up 
to a point and then decline. This 
decrease indicates that the Jz built up 
during the practice on the warming-up 
activity summated with the /z accum- 
ulated during the practice on the main 
learning task. 

4. Increasing amounts of massed 
warming-up activity beyond a certain 
point did not result in a further de- 
crease in performance. It was sug- 
gested that beyond a certain amount 
of warming-up activity, the ratio 
between the amount of set reinstated 
and the amount of Jz accumulated 
becomes constant. 

5. Because the accumulation of ad- 
ditional Jz was prevented, distributed 
warming-up activity resulted in greater 
increments in performance than did 
massed warming-up activity. 

6. Increasing amounts of distributed 
warming-up activity resulted in pro- 
gressively greater increments in per- 
formance. The possibility was consid- 
ered that the adoption of a particular 
set may have been facilitated by the 
constant shifting from working to 
resting with the distributed warming- 
up activity. 


(Manuscript received November 29, 
1951) 
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DECREMENTS AND RECOVERY FROM DECREMENTS IN A 
SIMPLE WORK TASK WITH VARIATION IN 
FORCE REQUIREMENTS AT DIFFERENT 

STAGES OF PRACTICE! 


EDWARD A. BILODEAU 
Perceptual and Motor Skills Research Laboratory, Human Resources Research Center 


Almost all of the recent research in 
motor behavior has dealt with motor 
tasks in which S gradually acquires 
considerable proficiency. In the the- 
oretical analysis of the data two dif- 
ferent types of intervening variables 
have been postulated: one for the 
learning factor (sHR) and one for the 
work factor (Jr). Typically, Jz and 
sHp vary together and are studied in 
controlled situations where the effects 
of one can usually be isolated from 
those of the other. It is equally desir- 
able, however, to study one of these 
factors in situations where the other 
remains fairly constant and equal be- 
tween conditions throughout practice. 

It is also true that in the typical 
motor performance experiment the nth 
response is often unlike the first. This 
appears to be a serious shortcoming 
whenever J, is related to duration of 
the practice session or number of pre- 
vious responses. 

A device has been constructed for 
this study which presumably allows 
for equal sH p’s between conditions and 
fairly constant amounts of sHr 
throughout practice. This has been 


1 The experimental work for this study was 
performed as part of the United States Air Force 
Human Resources Research and Development 
Program and is described in (2). The opinions 
or conclusions contained in this report are those 
of the author. They are not to be construed 
as necessarily reflecting the views or endorsement 
of the Department of the Air Force. 

The author wishes to express his appreciation 
to 2nd Lt. Thomas G. Ferguson for constructing 
and calibrating the apparatus, and to Mr. Robert 
E. Morin, who supervised the statistical analysis 
of the data. 


accomplished by demanding a rela- 
tively simple response which can be 
assumed to be fairly well established 
in the response hierarchy. Further, 
decrements in performance have been 
induced through manipulations of the 
amount of force required to accom- 
plish a simple motion rather than the 
much more customary procedure of 
varying the duration of the practice 
period (1). 

Two different forces were required 
to turn a manual crank. One group 
practiced with one of these forces while 
a second group practiced with a second 
force. The third and fourth groups 
used the same forces as the controls 
but after each minute of practice the 
force required for handle-turning was 
changed. Thetwoexperimental groups 
differed from each other in that they 
never practiced with the same force 
during the same minute. In other 
words, they were out of phase. It was 
thus hoped that by systematic varia- 
tion in the force required to move a 
resistant object the effects of force and 
locus of change in force could be 
isolated. 

MeETHOD 


Subjects.—A total of 160 basic airmen trainees 
at Lackland Air Force Base was divided equally, 
but unsystematically, into four groups. No S 
had received prior practice with the apparatus. 

A pparatus.—The basic apparatus consisted of 
a manual crank and a system allowing for con- 
trolled variation in the force necessary to turn 
the handle at any given rate of revolution? The 


2 A series of cranking experiments has been 
reported by Reed (4) in which such variables as 
radius of crank handle, braking force, orientation 
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TABLE 1 


HorsEPoweER REQUIREMENTS FOR Loaps 0 AND 
4 aT SEVERAL REPRESENTATIVE RATES OF 
Rotation oF CrRANKING DEVICE 




















Rate of Rotation (Rev./20 Sec.) 
Load 
40 | 50 | 60 70 | 80 
0 .0095 | .0135 | .0180 | .0225 | .0280 
4 .0285 | .0410 | .0545 | .0690 | .0850 











crank handle, with a turning diameter of 9 in., 
was mounted in the horizontal plane at the edge 
of a table. Upon the completion of each revolu- 
tion of the handle, a counter recorded a score of 
one. The force required to turn the handle could 
be instantaneously varied by five steps, i.e., 
Loads 0, 1, 2, 3, and 4. The two extreme loads 
(0 and 4) were used in the present study. Horse- 
power (HP) values over a wide range of rate of 
revolution were calculated for Loads 0 and 4 by 
means of standard engineering calibration pro- 
cedures. The HP requirements at representa- 
tive rates of revolution (number of revolutions 
per 20 sec.) are presented in Table 1. This table 
makes evident the following properties of the 
crank system: (a) HP increases with increasing 
rate of rotation, (b) the difference in HP between 
the two loads increases with increasing rate of 
rotation, and (c) HP ratios for the two loads 
remain constant over all rates of rotation. For 
example, the ratio of Load 4 to Load 0 equals 
3.06 regardless of the rate of rotation. It is also 
true that the ratio of the force for Load 4 to that 
for Load 0 equals 3.06 for any given rate of 
rotation. 

Procedure.—The Ss of all four groups, stand- 
ing before the handle and using a standard grip, 
rotated the handle as fast as possible for an 
initial continuous practice period of 5 min. (ini- 
tial practice). After a rest period of 40 sec., 
they practiced for an additional continuous prac- 
tice period of 3 min. (final practice). Group 0 
practiced with the minimum required force 
(Load 0); Group 4 practiced with the maximum 
force available (Load 4). The third and fourth 
groups, (Groups 0-4 and 4-0), alternated between 
these two loads after each minute of initial prac- 
tice, Group 0-4 beginning with Load 0, and 
Group 4-0 beginning with Load 4. All four 
groups practiced throughout final practice under 
conditions identical with those obtaining during 
the last minute of initial practice, that is, Groups 


of the crank, height of the crank as well as hand- 
edness, direction of rotation, bodily position, and 
fatigue are discussed. 


0 and 0-4 practiced with Load 0, and Groups 4 
and 4-0 practiced with Load 4. 

No S was told about the durations of the 
practice periods nor instructed about the crank 
loads. The number of crank revolutions per 
20-sec. periods was recorded. 


RESULTS 


Analysis of scores in terms of rate of 
rotation. The mean number of revo- 
lutions per 20 sec. is plotted against 
successive 20-sec. periods for both ini- 
tial and final practice in Fig. 1. The 
upper curve is for Load 0, the bottom 
curve is for Load 4. Both curves 
appear to have the same origin, afford- 
ing some basis for the contention 
that initially there were equal habit 
strengths for both groups. With prac- 
tice, however, there are marked decre- 
ments for both, the performance with 
the heavier load falling off more rapidly 
for the first minute. After the first 
minute there appear constant differ- 
ences in output. The indication is 
that each is approaching a different 
minimum. 

During the first minute, Groups 0 
and 0-4 have worked with Load 0, 
Groups 4 and 4-0 have worked with 
Load 4. The mean difference between 
loads, evaluated by means of the 
within-groups error estimate, is signifi- 
cant beyond the 1% level (¢ = 3.749, 
with 156 df). When Group #0 is in- 
stantaneously shifted from Load 4 to 
Load 0 after 1 min. of practice there is 
a marked increment in scores but not 
quite to the level of the Load 0 control 
(difference significant at the 6% level 
with t = 1.927 and 156 df). When 
Group 0-4 is shifted from Load 0 to 
Load 4 there is a pronounced decre- 
ment in output to a level somewhat 
below the Load 4 control, but this new 
level in rate of responding does not 
differ significantly from the Load 4 
control (¢ = 1.648, with 156 df). After 


each successive shift each group nearly 
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Fic. 1. Mean number of revolutions per 20 sec. is plotted against successive 20-sec. scoring periods 


approximates its control. Further, 
the mean number of crank turns dur- 
ing the entire 5-min. period for Group 
0-4 almost equals that for Group 40, 
and each is nearly equal to the average 
output of Groups 0 and 4. The mean 
totals for Groups 0, 4, 0-4, and 4-0 
are 62.86, 54.88, 58.06, and 58.04, 
respectively. 

After a rest period of 40 sec. each 
group shows considerable recovery, 
but none recovers to the initial level 
of responding. Numerically, each of 
the four groups recovered nearly iden- 
tical amounts, the difference being far 
from significant by the F test (F = .38, 
with 3 and 156 df). 

The difference between Groups 0 


and 4 after rest also remains fairly 
constant and is approximately equal 
to that of prerest practice. Group 4-0, 
practicing throughout final practice 
with Load 4, is remarkably similar to 
Group 4 in mean scores whereas Group 
0-4, working with Load 0, is much like 
Group 0. Evidently, the alternation 
between forces during initial practice 
had little differential effect upon per- 
formance during final practice. 
Analysis of scores in terms of horse- 
power.—When number of revolutions 
per 20-sec. periods is converted to an 
HP scale, output during both practice 
periods shows a direct relationship to 
load rather than the inverse relation- 
ship evident in terms of number of 
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revolutions. Further, a measure of 
HP recovery during rest was calcu- 
lated by converting the last score of 
initial practice and the first score of 
final practice to HP and obtaining the 
difference. An average of .0075 and 
.0080 HP was recovered by Groups 0 
and 0-4, while .0220 HP was recovered 
by both Groups 4 and 4-0. In other 
words, the two groups with the slower 
rate of rotation during the terminal 
period of initial practice recovered the 
most in terms of HP.* 

Analysis of decrement for Groups 0 
and 4.—An extensive analysis of the 
decrements for the individuals of the 
upper and lower quartiles has been 
published elsewhere (2). This anal- 
ysis tended to show that those groups 
beginning a practice period at a level 
of responding greater than that for 
other groups exhibited greater decre- 
ments. The results were interpreted 
as indicating that the magnitude of 
response decrement at any time during 
continuous practice is some increasing 
function of the magnitude of the initial 
level of responding, (sEz). 


Discussion 


After the first minute of practice the 
difference between the response rates 
of the two nonalternating control 
groups was nearly constant; with 
further practice they showed parallel 
decrement; with rest both groups spon- 
taneously recovered equal amounts. 
The two alternating groups showed a 
marked tendency to approximate the 
rate of their respective controls when- 
ever the same load was employed. 
This was true with instantaneous 
shifts in force treatment before rest 
and true also after rest when the 
groups might have been expected to 


* A second experiment now in preparation will 
deal with spontaneous recovery in greater detail. 


differ because of the prior alternation 
treatment. 

Thus, the data of both initial and 
final practice have indicated that rate 
of responding is a function of the pre- 
vailing load and of the duration of 
previous practice. The load obtain- 
ing during prior practice does not ap- 
pear to be particularly effective in 
producing differential response rates 
in subsequent practice. This suggests 
that the output of the responding 
member represents an adjustment to 
the duration of previous practice and 
to the load prevailing in such a manner 
that the former load is almost irrele- 
vant in determining present rate. 

Each of the above findings is con- 
sistent with the hypothesis that the 
Ss of each group are actually about 
equally fatigued. Such a conclusion 
is surprising to those who, perhaps, 
intuit that work against a heavy load 
is more fatiguing than work against a 
light load. This interpretation fails 
to recognize that S is free to choose his 
own pace. If it is granted that the 
present groups were about equally 
fatigued in spite of working against 
different loads and different time dis- 
tributions of these loads, then it must 
be granted that Ss choose their own 
pace according to the load condition. 
The hypothesis here is that there is a 
similar state of tolerance for fatigue 
from group to group. Given a single 
state of tolerance for fatigue, differ- 
ential rates are established as a func- 
tion of duration of previous practice 
and present load. Differences in rate 
attributable to load are such that load 
is not relevant for predicting subse- 
quent performance level. In order to 
maintain similar levels of fatigue from 
group to group there must be a differ- 
ence in rate of responding between 
groups working against different loads. 
In simplest terms, it is as if any S 
had only so much to give and that 
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rate (pacing) represents an adjust- 
ment to this state of affairs. 

It is interesting to contrast the fore- 
going interpretation of the results with 
the theoretical assumption underlying 
the choice of a cranking task in the 
first place. The task was originally 
designed so as to provide a situation 
eliciting equal sHz from load to load 
in order that the learning factor would 
not be confounded with the effects of 
Ip. Thus, it is reasonable to venture 
that under the conditions of the pres- 
ent study, the habit, drive, and decre- 
ment terms of the equation for deter- 
mining momentary effective behavior 
potential are equal from load to load. 

It will be recalled that HP for the 
heavy load group was greater than for 
the light load group. This is, of course, 
a statement which is relevant to work 
which is done against the external 
environment. In terms of energy ex- 
penditure, however, conversion of rev- 
olutions to HP neglects the weight of 
S’s arm. If the weight of the faster 
moving arm of the light load groups 
were taken into account, it seems rea- 
sonable to suppose that the HP ex- 
pended by this group would be similar 
to that expended by the heavy load 
group. 

The interpretation of the present 
data has in effect suggested that the 
phenomena of massing-spacing of prac- 
tice in motor learning studies be re- 
examined in regards to possible pacing 
effects. It could be, for example, that 
at least some of the difference in rate 
of responding results not so much from 
differential fatigue as from differential 
pacing appropriate to the distribution 
stimuli. 


SUMMARY AND CONCLUSIONS 


The Ss (N = 160) practiced rotat- 
ing a crank handle as fast as possible 
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for 5 min. with either a light or heavy 
loading of the crank. One group 
practiced with the light load, a second 
practiced with the heavy, and the third 
and fourth groups alternated between 
loads after each minute of practice but 
were out of phase. After a brief rest 
all groups practiced for 3 min. with 
the loading used just before the rest. 

During the first minute of practice, 
rate of responding dropped more rap- 
idly with the heavy load than with the 
light load. Thereafter, differences 
between the loads remained fairly con- 
stant. After a brief rest both load 
groups recovered equally in terms of 
rate, but the heavy load group recov- 
ered more in terms of HP. 

When the two groups which had 
practiced under one load condition 
were instantaneously shifted to the 
other, rate output was approximately 
equal to that of the nonshifting 
groups. Furthermore, after rest both 
alternating groups spontaneously re- 
covered to the level of their non- 
alternating controls as if unaffected 
by the alternation treatment of initial 
practice. 


(Manuscript received December 14, 
1951) 
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LEVEL OF REPETITION IN THE “SPREAD OF EFFECT”! 


FRED D. SHEFFIELD AND WILLIAM O. JENKINS 


Yale University 


In an earlier study by the writers (3) 
it was shown that guessing habits are 
a major factor in accounting for the 
“spread of effect.” The original study 
has been followed by a number of in- 
vestigations (1, 2, 3, 4, 7, 8, 9), most 
of which have provided ample confir- 
mation of the guessing-sequence inter- 
pretation. It appears that Thorndike’s 
“spread-of-effect” phenomenon, which 
he considered a proof of the automatic 
action of reward, is an artifact due 
chiefly to S’s tendency to guess num- 
bers systematically rather than ran- 
domly. The operation of this artifact 
is very clearly seen in a study (7) that 
demonstrated a spread gradient under 
circumstances in which numbers were 
guessed without either reward or a 
learning task. In this study, however, 
while a clear-cut gradient was found, 
the over-all level of repetition was con- 
siderably lower than the levels typically 
obtained in Thorndike’s investigations 
involving a comparable number of 
choices (10). 

The gradients obtained without re- 
ward or learning (7) are compared in 
Table 1 with the average of three 
appropriately selected experiments by 
Thorndike (10, Exp. F, H, and I). 

Thus, in the position immediately 
after reward, Thorndike obtained 
around 20% repetition when using 
numbers from 1 to 10. But in the 
experiment without reward or learn- 
ing, a value of only 13.3% was found 
in this position. The purpose of the 


1 This research was facilitated by the Labora- 
tory of Social Relations, Harvard University, and 
Yale College. The writers are grateful to Dr. F. 
Mosteller for valuable comments on the manu- 
script and to Mr. James Stephens who ran the 
individual Ss as a Bursary assistant financed by 
Yale College. 


Harvard University 


present experiments was to investigate 
some of the factors that might account 
for this discrepancy. A more general 
purpose was to provide information on 
the relative importance of the various 
factors that influence the level of 
repetition in the “spread-of-effect”’ 
phenomenon. 

A factor thought to be of impor- 
tance in accounting for the difference 
in levels of repetition in the Thorndike 
situation and in that without learning 
was the presumed distracting effects 
of the learning task. In the study of 
spread without reward or learning (7), 
the Ss thought they were participating 
in an ESP experiment and that the 
task was to reproduce, through “clair- 
voyance,” designated portions of a 
printed table of random numbers. 
Specific instruction as to the nature of 
randomness was given. There was no 
learning task and S could concentrate 
on guessing numbers at random. The 
S wrote his numbers on a prepared 
form and could see his prior guesses. 
In the typical Thorndike experiment, 
on the other hand, S’s task is to learn 
word-number pairs—a situation that 
could distract him from carrying out 
instructions to guess numbers “‘at ran- 
dom.” Thorndike’s S’s usually guessed 
aloud and could not keep track of prior 
guesses visually. 


Taste | 


ComPARISON OF EXPERIMENTS 








Position in after Gradient 
Experiments 








Without reward or 


learning (7) 13.3 | 11.4} 11.2 | 10. 
Thorndike (10) 20.4 | 19.3 | 14.5 | 17. 
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Thus, if S has systematic tendencies 
in number guessing, these might be 
maximized in the Thorndike situation 
with attention directed to a learning 
task, and minimized with attention 
directed solely to reproducing a table 
of random numbers. This hypothesis 
had an a priori reasonableness, and 
previous findings (3) had indicated 
that the level of repetition was higher 
under conditions of distraction. To 
test this hypothesis, guessing sequences 
were obtained under two conditions: 
(a) no distraction, and (6) with the 
distraction of an irrelevant learning 
task going on at the same time num- 
bers were guessed. An attempt was 
made to duplicate the distracting 
aspect of Thorndike’s task under con- 
ditions which precluded the action of 
reward as a source of “spread of 
effect,” but allowed the operation of 
the guessing-sequence factor. In addi- 
tion to this major comparison, evi- 
dence was collected on the effect of 
other factors which could raise the 
level of repetition in the Thorndike 
situation. 

MeEtTHopD 


Two independent sets of experiments were 
carried out, one using an individual testing pro- 
cedure at Yale, the other employing group testing 
at Harvard. The Ss were undergraduate stu- 
dents. The basic design consisted of comparing 
the performance under experimental conditions, 
in which Ss guessed numbers while simultane- 
ously learning verbal material, with the perform- 
ance under control conditions, where numbers 
were guessed without the distraction of any 
learning task. 

The distracting task under the experimental 
conditions was paired-associates learning of three- 
letter meaningful words. These were randomly 
selected and paired from an available exhaustive 
list (6). In all experiments, Ss were instructed 
to guess a number from 1 to 10 at random as 
soon as the stimulus term of a pair was exposed. 
In some experiments the anticipation procedure 
was used; in others, tests with only stimulus 
terms followed presentation of the pairs. In all 
cases, both in acquisition and testing, S was 
required to guess his number immediately after 
the presentation of each stimulus term. With 


the individual procedure S called out his number 
and E recorded it; with the group procedure S 
wrote his number on an answer sheet and imme- 
diately covered it with his hand. With one 
exception to be noted later, the exposure rate 
used throughout all experiments was 4 sec. 

Under the experimental conditions, Ss were 
told that the purpose was to test the effect of 
distraction on learning. The requirement of 
guessing a number was explained to S as an 
attempt to provide a distraction which would 
interfere with the learning process. Under the 
control conditions, S was asked to guess numbers 
from 1 to 10 in response to a 4-sec. signal, with 
no rationale offered other than an appeal to his 
faith in the scientific importance of the study. 

With the individual testing procedure, a modi- 
fication of the Hull exposure apparatus was used. 
In the main experiment ten paired associates 
were employed, and these were varied in six 
random orders of presentation, using the standard 
anticipation procedure. The roll of six random 
changes was run through twice, yielding a total 
of 120 number guesses per S. The same Ss 
(N = 22) served under both the experimental 
and control conditions. Experimental condi- 
tions were always run first. The decision not to 
randomize the order of the experimental and 
control conditions was based partly on the 
assumption that a control was provided by the 
previous study referrred to (7). A further im- 
portant consideration was the possibility that 
running Ss first under the control conditions 
might lower their motivation when they subse- 
quently learned under “distraction.” Two aux- 
illiary groups (V = 15 and N = 9) were run 
with nonanticipation techniques to be described 
later. 

In the group-testing procedure, the experi- 
mental condition involved 60 paired associates 
printed on flash cards. Both members of a 
given pair were exhibited simultaneously in the 
training trial. The nonanticipation method of 
testing was used; the test trials consisted of pre- 
senting only the stimulus terms in a different 
random order from that employed in the training 
trial. Thirty-two Ss were run under these con- 
ditions; 52 additional Ss were run under the 
control conditions of number guessing only. 
Under both experimental and control conditions, 
120 number guesses per S were obtained. 

An additional 11 Ss were run with the group 
procedure using the same order of stimulus-term 
presentation on the test as had been used on the 
training trial. This group provided evidence on 
the effect of the fixed-list factor, present in all 
of Thorndike’s studies and a possible source of 
greater repetition through the action of word- 
number associations. 

The scoring procedure in all experiments was 
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similar to that described previously (7). The S’s 
total series of guessed numbers were divided into 
sequential subsets (to be described later in con- 
nection with each experiment). Successive sub- 
sets were aligned in such a way that S’s guessed 
numbers in the same ordinal position within 
subsets could be compared. Each repetition of 
the same guess in the same ordinal position in 
two successive subsets were scored as a repeat, 
and the presence or absence of other repeats in 
the five following positions was determined. 
This scoring method parallels that of Thorndike 
(10); the only important difference is that 
“rewarded” responses in Thorndike’s procedure 
are replaced by chance repetitions in the present 
experiments. 

In applying significance tests the percentage 
repetition score of each S (at a given point on the 
gradient) was treated as a single observation. 
Thus WN in each comparison is the number of Ss 
involved rather than the total number of repeti- 
tions. Except where the contrary is noted, the 
1% level of significance was accepted. 


Resutts oF Main ExperIMENTS 


Individual testing results —In the 
main experiment with individual test- 
ing, each of the 12 runs through the 
list of ten pairs was considered a trial, 
(i.e., a “‘subset’’), and each such trial 
was compared with the succeeding one 
to obtain the “after” gradient. The 
data obtained are shown in Table 2. 

It can be seen that the expectation 
of a higher level of repetition with a 
distracting learning task is not borne 
out. The difference is for the most 


TABLE 2 


GraDIENT OF REPETITION FoLLow1nc CHANCE 
Repeats: InptvipvAL TESTING WITH THE 
ANTICIPATION PROCEDURE AND Eacu S 
Servinc as His Own Controi 
(22 Ss) 








Position 
after Chance Repeat 
Condition 








Mean per cent repeti- 
tion with learning 
task 17.2) 14.7} 11.7] 11.2] 14.7 

Mean per cent repeti- 


tion without task | 19.2) 16.9) 15.6) 11.7} 12.7 




















TABLE 3 


GrapviEent or Repetition Fottowrnc CHANCE 
Repeats: Group TESTING WITH THE 
NONANTICIPATION PROCEDURE 








Position 
after Chance Repeat 
Condition 








Mean per cent repeti- 
tion with learning 
task (32 Ss) 16.9} 13.7} 13.1] 13.1] 10.0 

Mean per cent repeti- 
tion without task 
(52 Ss) 15.4) 14.4) 10.9) 11.9) 7.5 




















part in the opposite direction, although 
not significantly so at any point on 
the gradient. The p value for the 
largest difference in Table 2 (Position 
3) was .21. However, both gradients 
are significantly higher in level than 
the one obtained in the previous study 
(7). Thus, whereas the previous result 
obtained a level of only about 13% 
repetition in the first position, the pres- 
ent finding gives levels of about 17% 
and 19%, respectively, for the experi- 
mental and control conditions. 

It would appear that omitting in- 
structions about the nature of random 
numbers and eliminating the chance 
to keep track of guesses visually, raised 
the level of repetition to values closer 
to Thorndike’s, but that the addition 
of a distracting task had no further 
effect. 

Group-testing results —In the main 
group experiments Ss wrote their num- 
bers on forms having four columns of 
30 spaces each. Gradients were ob- 
tained by comparing each column with 
the succeeding one, as previously 
described. The results are shown in 
Table 3. 

As in the individual test results, the 
initial hypothesis is not borne out to a 
statistically significant degree. What 
differences were obtained are in the 
direction expected from the task-dis- 
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traction hypothesis, but the differences 
are not significant either over-all or at 
individual points. The smallest p 
value for any of the differences between 
experimental and control conditions 
shown in Table 3 is .20. As in the 
individual testing results, however, 
the levels of repetition are appreci- 
ably higher in positions immediately 
after chance repeats than in the earlier 
group study (7). 


ReEsutts oF AMPLIFYING 
EXPERIMENTS 


Anticipation versus nonanticipation 
in individual testing. —From the guess- 
ing-sequence hypothesis it is expected 
that if the task is too distracting, 
guessing sequences will disappear. 
Thus the hypothesis depends on each 
guess being a stimulus which influences 
the next guess as a response. Ifa very 
distracting task intervenes between 
each guess, it might render the pre- 
ceding guess an inconsequential as- 
pect of the stimulus pattern at the 
time the succeeding guess is made. 

It was thought that the anticipation 
method used in individual tests might 
involve a greater degree of distraction 
than Thorndike’s procedure. Thus 
it might be expected that a nonantici- 
pation procedure would yield levels of 
repetition more closely approaching 
Thorndike’s. Totest this hypothesis, 
15 Ss were run with a nonanticipation 
individual procedure comparable to 
that used in group testing. 

The mean per cent repetition from 
positions one through five was: 15.8, 
18.0, 12.2, 12.7, and 9.3. By com- 
paring these figures with the top line 
of Table 2 it can be seen that the 
hypothesis is not supported; the non- 
anticipation method did not generally 
increase the level of repetition over 
that obtained with the anticipation 
procedure. 

Acquisition versus recall in individual 


testing.—With the anticipation proce- 
dure, S’s task is a mixture of trying to 
learn each pair that has not yet been 
mastered and to recall the correct 
response term before it is presented. 
In the Thorndike experiment, how- 
ever, S does not have an opportunity 
to rehearse each pair—at least in the 
early part of learning where S has to 
find out what the pairs are by guessing. 
For the most part, as each stimulus is 
presented, S is attempting to recall 
any response which may have been 
called “right” on a previous trial or is 
attempting to recall what responses 
have been called “wrong” on previous 
trials. Thus it can be argued that the 
Thorndike situation is one in which 
the chief distraction is an attempt to 
recall rather than to rehearse. 

In an attempt to separate the fac- 
tors of rehearsal and recall that distrac- 
tors which contribute to spread, a 
modified nonanticipation procedure 
was used with nine Ss. The same 
type of list (i.e., ten pairs and six ran- 
dom changes) was used that had been 
used in the main individual experi- 
ment. However, instead of attempted 
anticipation when each stimulus term 
was shown, S alternately concentrated 
on memorizing the pairs (both terms 
shown simultaneously) without overt 
anticipation on odd-numbered trials 
and attempting recall—to stimulus 
terms only—on even-numbered trials. 
As in the main experiment the list was 
presented 12 times, giving six training 
trials with pairs presented alternated 
with six tests in which only the stimuli 
were presented. 

Each S learned two different lists 
under the above conditions. The two 
lists were learned under different pre- 
sentation rates, one with a 4-sec. cycle 
and the other with a 2-sec. cycle. 
Analysis consisted of comparing suc- 
cessive training trials and, separately, 
successive tests, with the same pro- 
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TABLE 4 


“Spreap” Arounp CHANCE RepEATs WHILE 
Memorizinc (TRIALS) AND ATTEMPTING 
Recatt (Tests) or AN IRRELEVANT 
LearninGc Task wit Eacu S 
Servinc as His Own 
Controt (9 Ss) 








Position after Chance Repeat 


M 














1 2 3 4 5 
Trials 20.1 | 14.9 | 12.0 | 10.4 | 13.3 
Tests 30.6 | 21.9 | 18.9 | 20.0 | 17.6 

















cedure as used in the main individual 
experiment. The results are sum- 
marized in Table 4. No consistent 
effect of the presentation rate was 
found, so the results are broken down 
only by “trials” versus “tests,” an 
analysis which presumably pits the 
distracting effects of rehearsal against 
those of attempted recall, respectively. 

It should be noted that the results 
are fairly unstable with only nine Ss 
even though each S learned two lists. 
However, while the “trial” results are 
roughly comparable with the findings 
in the main individual experiment, the 
“test” results show a significantly 
higher level of repetition at all points 
on the gradients. The results indi- 
cate that guessing sequences are more 
likely to operate when S’s symbolic 
activity is occupied in attempting ver- 
bal recall of the correct response than 
when both members of a pair are 
shown and S is silently attempting to 
memorize these. 

Use of a fixed list in group testing.— 
A feature of Thorndike’s experiments 
(10) which could increase the over-all 
level of the after gradient was his use 
of fixed orders of stimuli in successive 
presentations of lists. In all the pres- 
ent experiments described thus far 
mixed lists were used. That is, the 
order of the stimuli was varied on 
successive acquisitions and test trials 
in individual testing and was uncor- 
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related in training and test in the 
group testing. This procedure pre- 
cludes the possibility that initial word- 
number associations might raise the 
level of repetition in the after-gradient. 
Thus, for example, if S has an initial 
tendency always to guess “two” when 
the stimulus is “leg” he will tend to 
repeat himself in that same position if 
the list is fixed. To the extent that 
such initial associations operate, the 
entire level of repetition of “wrongs” 
is raised. Moreover, if an association 
is formed during the experiment be- 
tween stimulus terms and the wrong 
response number guessed, any ten- 
dency for the association to persist 
will show up as an increased repetition 
in the same position of a fixed list. 

A check on the effect of fixed order 
was provided by an auxiliary group 
experiment (N = 11) in which a fixed 
order of stimuli was used in both train- 
ing and testing. Otherwise the pro- 
cedure was exactly the same as the 
main group experiment (with learning 
task), and the same list of pairs was 
used. 

The mean per cent repetition from 
Position 1 through 5 was: 20.7, 18.8, 
14.3, 13.2, and 7.4. By comparison 
of these results with the results for the 
task groups in Table 3, it is apparent 
that the level of repetition is higher 
with the fixed than with the mixed 
list. The differences at the first and 
second points are significant at the 2% 
level of confidence. The findings may 
be interpreted as indicating that an 
important factor explaining Thorn- 
dike’s level of repetition in his typical- 
spread studies was his consistent use 
of fixed-stimulus orders. 


Discussion 


The main import of the present 
results is that guessing sequences seem 
sufficient to account for Thorndike’s 
spread phenomenon. In the earlier 
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study (7), the nature of random num- 
bers had been carefully explained and 
Ss had an opportunity to keep track 
of their sequences visually. Even 
under these conditions a spread gradi- 
ent was found, but with a low level of 
repetition. In the present studies the 
most comparable procedure without 
either instructions on randomness or 
opportunity to keep track of sequences 
is the control condition in the main 
group experiment. As can be seen in 
Table 3 the level of repetition was 
raised by eliminating these factors. 
Also, when Thorndike’s fixed-list fac- 
tor was included, the level of repetition 
was further raised. Moreover, when 
the task was made—as in Thorndike’s 
situation—primarily a matter of recall 
rather than acquisition, an appreci- 
able increase in repetion was obtained 
(Table 4). It would appear, there- 
fore, that the more closely the factors 
operating in the Thorndike situation 
are approximated, the more nearly 
Thorndike’s levels of repetition are 
achieved. In none of the present 
studies did any presumed “spread of 
effect” of reward have an opportunity 
to generate the spread gradient; in all 
cases the analysis was based on “spread 
of effect” following chance repeats. A 
result very consistent with the present 
findings has recently been reported by 
Fagan and North (1) who ran Ss indi- 
vidually, apparently did not train their 
Ss in the nature of randomness, and 
prevented their Ss from keeping track 
of their number sequences visually. 
As in the present studies, their gradi- 
ents show a higher level of repetition 
than in the original study (7) of 
“spread of effect” without reward or 
learning. 


It should be noted that in all the present 
studies, and in previous comparable ones (1, 7), 
the analysis procedure involves 100% repetition 
in the “zero” position. In Thorndike’s studies the 
percentage repetition at the zero (“‘rewarded”’) 


position was less than this figure. Thus, the 
present studies, in attempting to reproduce 
Thorndike’s gradients, might yield an even 
higher level of repetition than in Thorndike’s 
studies because of the greater frequency of repe- 
tition at the zero point. However, in the experi- 
ment that most closely reproduced Thorndike’s 
causal factors (see Table 4), the levels are as high 
as those reported by Muenzinger and Dove (5) 
who experimentally induced 100% repetition at 
rewarded points by teaching certain word-number 
associations in advance. 

A final word should be said about the relation 
between the present results and the “probability 
bias” hypothesis offered by Smith (8). Subjects 
have a variety of habits in guessing numbers, 
some idiosyncratic and some fairly general for 
the population (2, 3, 8). Smith’s “probability 
bias” factor—the tendency for Ss to avoid repeat- 
ing the same guess two or more times in a row—is 
a very common tendency noted earlier (3). In 
guessing numbers from 1 to 10, however, it would 
account only for a gradient that starts with 11% 
repeats in Position 1 after and falls off to 10% 
(chance) in Position 5 after. But even this fac- 
tor is subsumed under the hypothesis of habitual 
guessing sequences as the source of spread. As 
Smith notes in his article, a more general state- 
ment of “probability bias” would include any 
tendency on the part of S to fail to guess at 
random. Presumably any failure to guess at 
random is caused by nonrandom habits of guess- 
ing. The factor stressed by Smith is operative 
with almost every S, but is only a small con- 
tributor to the gradient or to the over-all level 
when ten choices are used. The present results 
attempt to get at the more important factors by 
duplicating everything in Thorndike’s procedure 
except the automatic strengthening effect of a 
reward on adjacent connections. The results 
give further support to the interpretation that 
Thorndike’s phenomenon is an artifact of guessing 
habits. 

It should be noted that the distraction factor 
investigated in the present studies and previously 
shown (3) to appreciably raise the level of the 
spread gradient could, under certain circum- 
stances, lower the gradient by interfering with 
guessing sequences. According to the guessing- 
sequence interpretation, the preceding guess 
functions as a stimulus in influencing the follow- 
ing guess, depending on S’s individual number- 
guessing habits. A distraction will facilitate 
sheer habit if the distraction interferes with S’s 
carrying out instructions to reproduce random 
sequences rather than such habitual sequences as 
counting up or down. A highly effective dis- 
traction, however, would also interfere with the 
tendency of a preceding guess, as a stimulus, to 
influence a succeeding guess as a response. Thus, 
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SPREAD OF EFFECT 


under certain conditions of distraction, all guess- 
ing habits might be eliminated as determiners of 
S’s sequences. Apparently, the distractions in 
the main experiments were somewhere between 
these two extremes. In comparing the distrac- 
tions of rehearsal and recall as tasks, however, it 
would appear that the recall task is a more 
optimal condition for the operation of guessing 
habits. 

The fundamental question asked by the pres- 
ent experiments is, of course, whether the guess- 
ing-sequence interpretation (3) of Thorndike’s 
“spread” effect can account entirely for the 
phenomenon. It had been shown (7) that pure 
guessing of numbers produced a Thorndike gra- 
dient without reward or learning, but at a much 
lower over-all level than in the typical Thorndike 
experiment. The present studies can be regarded 
as an attempt to approach Thorndike’s condi- 
tions while at the same time keeping the num- 
ber-guessing task and the rote-learning task 
completely divorced from each other. The ex- 
periments were successful in raising the level of 
repetition to the values reported by Thorndike, 
and to even higher levels, when his conditions 
were reinterpreted from the standpoint of the 
guessing-sequence hypothesis. The chief conclu- 
sion, therefore, is that Thorndike’s explanation of 
his results called attention to the wrong causal 
factors present in his experimental situation. 


SUMMARY 


A number of experiments were per- 
formed to study factors that influence 
the level of repetition in the “spread- 
of-effect” phenomenon. The experi- 
ments tested implications of the guess- 
ing-sequence hypothesis. Some groups 
guessed numbers while concentrating 
on an irrelevant learning task; other 
groups merely guessed numbers with- 
out any distracting task. 

The level of repetition was raised 
by omitting instructions on random- 
ness and opportunity to keep track 
of sequences. Moreover, Thorndike’s 
fixed order of presentation of stimulus 
terms raised the level still further. 
Finally, when the task emphasized 
recall rather than acquisition of the 
learning material, an additional in- 
crease in repetition was found that con- 
siderably exceeded the levels obtained 
by Thorndike. 


107 


The conclusion is drawn that the 
more closely Thorndike’s situation is 
approximated—as reinterpreted from 
thestandpoint of the guessing-sequence 
hypothesis—the higher the level of 
repetition even when reward cannot 
generate the gradients. It appears 
that guessing sequences are sufficient 
to account for Thorndike’s spread 
phenomenon. 


(Manuscript received December 14, 
1951) 
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MASSING AND SPACING PHENOMENA AS FUNCTIONS OF 
PROLONGED AND EXTENDED PRACTICE! 


EDWARD A. BILODEAU 
Perceptual and Motor Skills Research Laboratory, Human Resources Research Center 


The effects of distribution of prac- 
tice over periods of several days have 
hardly been investigated. The lack 
of data is particularly apparent within 
the area of motor behavior. 

Since the concepts of work and 
learning have been described as essen- 
tial for the explanation of motor per- 
formance (4, 5), it has been desirable 
to study the role of each in situations 
where the effect of one can be isolated 
from that of the other or where the 
effect of one is negligible. The decre- 
mental effects of practice have often 
been held to a minimum by extreme 
spacing of practice, while the effects 
of learning can be held to a minimum 
by devising a situation where the ap- 
propriate responses are readily avail- 
able as S begins practice. 

In the present investigation work 
has been varied by the spacing of prac- 
tice and learning has been minimized 
by demanding a relatively simple 
though effortful response. Further, 
the use of a simple response has 
given greater assurance that terminal 
responses are replicates of responses 
initiated early in practice. This desid- 
eratum is not always obtained in 
studies where learning is assumed to 
have taken place. 

The purpose of this study was to 
relate (a) level of responding or rate 
of response early and late within a 


1 The experimental work for this study was 
performed as part of the United States Air 
Force Human Resources Research and De- 
velopment Program. The opinions or conclu- 
sions contained in this report are those of the 
author. They are not to be construed as neces- 
sarily reflecting the views or endorsement of the 
Department of the Air Force. 


practice session, and (b) amounts of 
spontaneous recovery for relatively 
short and long rest periods, to the 
number of previous days of practice 
and to the manner of distribution of 
prior practice under conditions where 
the response is relatively simple but 
effortful. 


MeETHOD 


Subjects.—The Ss of this experiment were 52 
airmen at Lackland Air Force Base who had just 
completed basic training and were awaiting as- 
signment. None of these Ss had received prior 
practice with the test. 

Apparatus.—The apparatus was the manual 
crank described in previous publications (1, 2). 
In brief this apparatus consists of a crank handle 
(with a turning diameter of 9 in.) mounted in the 
horizontal plane at the edge of a table top and a 
tachometer generator for exerting a variable 
braking force upon the crank shaft. The prop- 
erties of the internal system are such that the 
force required to turn the handle at any given 
rate of rotation is known. Although five differ- 
ent force variations were available, only one was 
used for the present investigation. The one em- 
ployed has been previously described as Load 4 
(3). With this load .0410, .0545, .0690, and .0850 
horsepower, for example, are required at 50, 60, 
70, or 80 revolutions per 20 sec., respectively. 
Any of these rates of revolution appears to be 
quite strenuous and leads to reports of great 
muscle fatigue. The number of revolutions per 
20-sec. periods was recorded. 

Procedure.—The Ss were divided equally and 
at random into two groups and instructed how to 
stand before the handle, to use the standard grip, 
and to rotate the handle as fast as possible 
throughout the practice session. 

For both groups, practice consisted of 8 min. 
of cranking on each of ten consecutive weekdays. 
Group 8, the massed-practice group, practiced 
without rest during the entire 8 min. each day. 
Group 4-44, the spaced-practice group, prac- 
ticed continuously for 4 min., rested for 4 min., 
and then finished practice with a second con- 
tinuous 4-min. practice period. 

The Ss of the two groups were run alternately, 
that is, an S of the massed group always followed 
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Fic. 1. 


Mean number of revolutions per successive 20-sec. trials on Days 1 and 10 with massed- 


practice (Group 8) and spaced-practice (Group 4-4-4) 


an S of the spaced group. Each S was run at 
approximately the same hour each day. Before 
each session, all Ss were reminded to crank as fast 
as possible between the “go” and “stop” signals. 
This reminder was not repeated for the spaced- 
practice group, however, during the interpolated 
rest period. Two Ss from Group 8 were lost 
because of scheduling difficulties. 


RESULTS 


The trial means for Day 1 and Day 
10 for each group are presented in 
Fig. 12 The Day 1 means for the 


2 Detailed tables of data from this study are 
available in (2). 


first 12 trials for both groups are 
remarkably similar, 4 min. of contin- 
uous practice being the experimental 
treatment (¢ = .473, with 48 df when 
computed with the scores of Trial 12). 
After Trial 12 and a rest of 4 min., 
Group 4444 spontaneously recovered 
about 24 rev./20 sec., or 93.67% of the 
initial rate of responding. Group 8, 
continuing practice without rest, con- 
tinues to decline throughout the 
remaining 12 trials. On Day 1 the 
spaced group remains above Group 8 
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practice (bottom pair of curves) on each of ten 
days 


during the terminal 4 min. of practice, 
a t significant beyond the .01 level of 
confidence being obtained between 
group scores on Trial 24 (¢ = 2.90, 
with 48 df). 

The comparison of Day 10 with Day 
1 suggests: 


1. The initial rate of responding on 
Day 10 is somewhat less than that for 
Day 1, and there appears little differ- 
ence between the groups attributable 
to the spacing of prior practice. On 
Day 10, Group 444 responded at 
96.03% of its Day 1 level and Group 8 
responded at 94.98% of its Day 1 
level. 

2. After 4 min. of practice on Day 
10 both groups were cranking faster 
than for the comparable point on Day 
1, and there appears to be little dif- 
ference between the groups. When 
the groups were compared by means 
of ¢ on the Trial 12 scores of Day 10, 
no significant difference was obtained 
(t = .375, df = 48). Pooling the data 
of the two groups, however, resulted 
in a t for Day 1 versus Day 10 which 
was significant far beyond the .01 level 


(t = 4.08, df = 48). 
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3. An interaction of trials by days 
(1 and 10) is suggested for Group 8. 
This group cranked at a fairly con- 
stant rate during the terminal portion 
of practice on Day 10. 

4. Group 444 recovers (after a 
4-min. rest) to a level almost equal to 
that of Day 1, but subsequently fails 
to show as rapid a decrement. The 
level to which the spaced-practice 
group recovers after 4-min. rest peri- 
ods thus appears invariant over days. 


Additional evidence for the state- 
ments suggested by the Day 10—Day 1 
comparison above can be obtained by 
plotting the condition means against 
10 successive days of practice, any 
day-by-day trend increasing the credi- 
bility of the summary statements. 

In Fig. 2, the first 20-sec. mean 
score (Trial 1) on each of the ten days 
is shown in the upper portion of the 
figure for both groups. The differ- 
ences between the groups are so small 
and unsystematic that a test of their 
significance was judged unnecessary. 
There appears, however, a slight 
downward trend over days suggesting 
that recovery level over successive 24- 
hr. periods decreases slightly (by 
about 4-5%) but regularly. An anal- 
ysis of between-day variance over day- 
by-subject variance for the pooled 
distributions yielded an F significant 
beyond the .01 level of confidence, 
confirming the downward trend (F 
= 4.57, 9 and 441 df). 

The Trial 24 means have been plotted 
in the lower portion of Fig. 2 for each 
condition. The upward trend of the 
data for both groups suggests that the 
terminal rate of responding is an in- 
creasing function of days of practice. 
This trend, too, appears to be real 
since the F (between-day variance 
over day-by-subject variance) was sig- 
nificant beyond the .01 level for each 
group separately (F = 9.26 for Group 
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Fic. 3. Mean gain in rate of responding after rest 
as a function of successive recovery periods 


8, and 4.53 for Group 44-4, df = 9 
and 207, and 9 and 225, respectively). 
The spacing of practice treatment 
has resulted in a slightly, but signifi- 
cantly, higher terminal level (Trial 24) 
of responding for the group with a 
4-min. rest, for, as noted previously, 
the difference between Trial 24 means 
is significant on Day 1. The differ- 
ence between the groups, however, is 
nowhere as great as on this day. 
Since (a) response rate during the 
middle and terminal portions of the 
practice session tends to increase over 
successive days, and (b) level to which 
recovered as a function of 24-hr. rest 
decreases slightly, it follows that the 
amount of 24-hr. spontaneous recovery 
for the spaced- and massed-practice 
groups should decrease as a function of 
days.* The mean 24-hr. spontaneous 
recovery, for successive days is plotted 


3It should be understood that 24 hr. of rest 
were interpolated between successive days of 
practice except for one 72-hr. rest between the 
fifth and sixth days of practice. Thus, the refer- 
ence 24-hr. rest is not always appropriate. 


in Fig.3. The mean day-to-day recov- 
ery after 4 min. of rest for Group 
444 is shown in the lower portion of 
Fig. 3. 

Although the curves are quite irreg- 
ular, the downward trends of all three 
functions are nevertheless apparent. 
Some of the irregularity of each of the 
three functions appears associated 
with the weekend of rest (72 hr.), for 
the weekend precedes or corresponds 
to the fifth recovery period for the two 
24-hr. curves and the sixth for the 
4-min. curve. The negative slope to 
the 4-min. recovery function follows 
from (a) above, since the level of recov- 
ery after 4-min. rest remains constant 
over days. The F based upon the 
day-by-day Trial 13 means for be- 
tween-day variance over day-by-sub- 
ject variance failed to reach the .05 
level of confidence with 9 and 225 df 
(F = 1.12). 


Discussion 


Hull (4) has assumed that the 
amount of reactive inhibition (JR) is 
an increasing function of the number 
of prior responses per unit time. If 
the decrements in rate of cranking can 
be attributed to reactive inhibition, 
then a number of the present findings 
can be used to test some of the impli- 
cations of Jz theory. 

Hull has further suggested that with 
the dissipation of Jz the resting re- 
sponse (s/z) is strengthened and that 
the resting response be given the prop- 
erties of habit.(sHr). Some of these 
properties of Jp and slr have been 
tested by Kirable (5) and in general 
the results have tended to verify many 
of the expectations concerning the two 
response-inferred variables. 


In order to test or discover some of the prop- 
erties of Jz and sl/r by means of the present data 
it is necessary to assume that the cranking habit 
(sHr as distinguished from sER) does not in- 
crease in strength as a function of number of 
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previous responses. It seems reasonable to make 
this assumption at present, yet it should be clear 
that it is made tentatively. 

If the resting response is strengthened during 
rest as Iz is dissipated, the initial rate of respond- 
ing should decrease as a function of the number of 
days of practice. The results confirm this pre- 
diction, yet the magnitude of the decrease on the 
tenth day of practice on an extremely effortful 
task was surprisingly low. There are, of course, 
the possibilities that the decremental effect was 
almost completely overcome by an opposed in- 
crease in (a) the strength of the cranking habit 
through learning, or (6) response through in- 
creased strength of the musculature. The finding 
that the rate of responding during the middle and 
terminal portions of practice increased as a func- 
tion of days suggests that (b) is probably correct. 
This can be interpreted as the growth of muscular 
strength quite apart from increasing response 
strength through learning. With this interpre- 
tation it is easier to understand why the differ- 
ence in initial response rate on Day 10 versus 
Day 1 was not larger. 

Two other findings of importance are that (a) 
the level of responding on Trial 1 on each day for 
the spaced group was remarkably similar to that 
for the massed group, and (b) the spaced group 
recovered to the same level of responding after 
four minutes of rest on each day of practice. 
Since Group 44-4 both responded more often 
and rested more often than the massed group, 
more opportunity was afforded Jz to dissipate. 
Consequently, the spaced group might be ex- 
pected to generate more sz. Neither (a) nor 
(b) above, however, suggests that more sJz has 
been accrued by the spaced-practice group. In 
contrasting these two groups the argument that 
augmented muscle strength and/or learning 
equaled a growing and permanent resting response 
(decrement) can still be advanced. This seems 
unlikely or at least quite unparsimonious for, if 
true, both the positive and negative reaction 
tendencies must grow at rates such that their 
magnitudes cancel when added and this must 
obtain on each of the ten days of practice. 

The interpretation of the massing and spacing 
data is, of course, limited by the range of dis- 
tribution chosen for study. Nevertheless, the 
range of inquiry has not been markedly restricted. 
It seems reasonable for the present to conclude 
that some distributions of practice phenomena 
are transitory and temporary in character. The 
results of this investigation have shown little 
differential effect ascribable to the manner of dis- 
tribution of practice on preceding days. This, 
too, seems to be characteristic of results found 
with motor Jearning tasks (6). 

The results have indicated systematic differ- 
ences in performance as a function of number of 
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days of practice. Day-by-day differences appear 
ascribable to an increasing capacity to do more 
work over a practice session—a point of view 
which might be interpreted as an increase in 
threshold tolerance for accumulated Jz or an 
increase in the strength of the muscles of the 
responding body member. 

These results have indicated further limita- 
tions of the properties ascribed to s/z as currently 
proposed in the interpretation of motor behavior, 
particularly in a situation where little or no 
learning may be assumed to have taken place. 


SUMMARY 


Two groups of Ss practiced cranking 
as fast as possible for 8 min. on each of 
ten successive weekdays. For the 
spaced-practice group a rest of 4 min. 
was interpolated between two practice 
periods of 4min. The massed-practice 
group cranked continuously for the 
entire 8 min. 

The data were presented as mean 
number of revolutions per successive 
20-sec. scoring periods on each day of 
practice. 

Continuous cranking led to marked 
decrements in rate of response during 
each experimental session. The major 
findings are listed below. 


1. The initial rate of responding for 
each day decreased significantly but 
by a small amount as a function of 
days of practice. There appeared to 
be no difference as a function of spac- 
ing of prior practice. 

2. Rate of responding at 4 or 8 min. 
was an increasing function of days of 
practice. The difference between the 
groups at 8 min. was attributable to 
the preceding 4-min. rest period. 

3. The level to which the spaced- 
practice group recovered after 4 min. 
of rest remained invariant over days. 

4. The amount of 24-hr. spontane- 
ous recovery decreased as a function of 
days for both massed- and spaced- 
practice groups. The difference be- 


tween the groups was related to the 
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difference in terminal rate of respond- 
ing of the preceding day. 

5. The 4-min. spontaneous recovery 
for the spaced-practice group decreased 
as a function of days. This decrease 
appears related to the increasing 
response rate immediately prior to 
rest. 

6. For the extremes of distribution 
of practice represented by the two 
groups it can be concluded that little 
or no difference in rate of responding 
over a number of days can be attrib- 
uted to the spacing of practice on pre- 
ceding days. Differences in perform- 
ance over a succession of days appear 
to be influenced more by the number 
of days of prior practice than by dis- 
tribution of this practice. 


(Manuscript received December 29, 
1951) 
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A NEW METHOD FOR ANALYZING PRINTED ENGLISH ! 


EDWIN B. NEWMAN AND LOUIS J. GERSTMAN ? 


Harvard University 


Language, as a form of human behav- 
ior, has long interested psychologists. 
Its scientific study has employed all of 
the methods which could be borrowed 
from other academic disciplines, as 
well as a fair assortment of the psy- 
chologist’s own stock-in-trade. And 
yet language is so: complex that even 
our most powerful methods of anal- 
ysis do not take us far. 

In many respects analysis at the 
level of words would be the most use- 
ful. Words come closest, at least, to 
the layman’s idea of what are the 
natural units of speech and communi- 
cation. But the difficulties are very 
great because the number of words is 
large and it is not easy to get homoge- 
neous samples of sufficient size. It is 
understandable, therefore, that the 
first application of new tools should be 
made at the level of letters rather than 
words. 

The present paper contains certain 
proposals for a new method which may 
prove to be both practical and fairly 
useful in estimating how much struc- 
ture or pattern a sample of language 
possesses. For the reasons given 
above, the application of the method 
has been limited to sequences of letters 
in ordinary text. In addition to 
describing the method, the results of 
an analysis of printed English are 
reported since the argument for the 
method depends largely upon the dem- 
onstration that it works in practice. 


1 This study has been carried out with assist- 
ance from the Human Resources Research Lab- 
oratory under contract AF33(038)14343, Psy- 
chological Laboratory Report PLR-2. 

2Now at Haskins Laboratory, New York, 
i # 


INFORMATION IN PRINTED ENGLISH 


The point of departure for this study is the 
estimation of the “information” contained in 
printed English by Shannon (6, 7,8). The term 
“information” comes to have a quite specific and 
technical meaning. In general, a message such 
as a sequence of printed symbols contains infor- 
mation to the extent that it directs the receiver 
of the message to choose one of a specified set of 
alternatives, whether they be objects, or actions, 
or more complex meanings. Each of the letters 
of our printed alphabet might, if we made the 
best possible use of it, direct the reader to one of 
26 equally probable alternatives, and a series of n 
letters could designate 26" alternatives. 

In practice, we know that the use of letters is 
very far from optimal. For one thing, they were 
originally ‘devised to represent the sounds of 
speech, not objects or ideas. For another, they 
are not used with equal frequency, and there are 
many combinations of letters which never occur 
in the normal use of language. Note, for example, 
the empty cells in Table 1 below. It becomes 
necessary to estimate how severe are these restric- 
tions and over how long a sequence their effects 
are felt. The greater is the extent of these con- 
straints, the less will be the information con- 
veyed by a message of any given length. 

The straightforward approach to the problem 
would require a tabulation of all of the sequences 
of letters which occur in the sample of language 
in which we are interested, and from this, an 
estimation of the probability which would attach 
to each such sequence. If we are content with 
single letters, with sequences of two or of three 
letters, this kind of tabulation can be, and has 
been, done. But the work required goes up in 
geometric proportions. There are roughly a 
hundred million million possible arrangements of 
10-letter sequences, so that it is evidently impos- 
sible to extend this type of analysis to series of 
this length. Clearly, the direct, frontal approach 
must be given up. 

Shannon has recently employed an interesting 
variant of an old game in order to gain some esti- 
mate of the information contained in longer series 
(7, p. 54). The device employs a kind of rever- 
sal, a dual, of the usual communication link. In 
place of transmitted information, the store of a 
subject’s verbal habits is drawn upon. He is 
given a series of letters from any convenient text 
and is asked to guess what is the next letter. He 
is either corrected and goes on, or he continues 


114 














METHOD FOR ANALYZING PRINTED ENGLISH 115 


to guess until the actual letter is found. In 
either case, the extent to which the guessing is 
better than chance indicates the extent to which 
the subject is constrained by his built-in verbal 
habits. Were he able to guess perfectly, the new 
letter could convey no new information. If his 
guess were random, the new letter could bring 
maximum information. From a limited experi- 
ment conducted in this manner, Shannon has 
constructed a curve showing the upper bound on 
the information added by each additional letter 
in a chain constrained by the previous (n-1) 
letters. 

There is an alternative procedure with which 
it is possible to test for dependencies over longer 
spans. This is the autocorrelation function 
which one of us has recently applied to sequences 
of vowels and consonants (4). By calculating 
the sum of the cross-products of all pairs of values 
in a time series which are separated by any given 
interval, a measure is obtained which expresses, 
in our present terminology, the dependence of 
the second value on the first. This is not the 
place to discuss the relative merits of these two 
basically different approaches, information and 
autocorrelation, for the problem of communica- 
tion. Two differences only are important for 
our present problem. First, with finite series 
the autocorrelation function throws away certain 
information that is retained in the informational 
analysis. This is a result of disregarding the 
intermediate values in a series and taking the 
cross-products across an unspecified gap. On 
the other hand, this is not as serious as it first 
seems, for, as Elias has recently pointed out, the 
autocorrelation sets an upper bound on the infor- 
mation in a time series (1). 

There is a second and more serious difficulty 
that cannot be surmounted soeasily. This is the 
clearly nonmetric character of the letters of the 
alphabet. The autocorrelation function requires 
that the values in the time series can be placed 
on some kind of a scale, usually a full-fledged 
ratio scale. It is hard to conceive any way in 
which letters can be ordered such that their 
“cross-product” would be meaningful. The only 
thing we dan do is to map them into a kind of 
multidimensional probability space in which 
there are as many dimensions as letters. Such a 
model gains us nothing. 


Tue CoeEFFICIENT OF CONSTRAINT 


In the absence of any practical sta- 
tistic to use in analyzing sequences of 
letters, we have turned to an alter- 
native measure which will be called 
the coefficient of constraint. The co- 


efficient of constraint is a measure 
which behaves roughly in the same 
way as does the coefficient of correla- 
tion, that is, it has a value of zero 
when there is independence of the two 
measures, and a value of one when 
there is complete dependence.’ It dif- 
fers from the coefficient of correlation 
in that it may have only positive 
values. To give some intuitive notion 
of what is involved, we shall first 
describe the procedure which was 
followed. 

The basic step consists of the con- 
struction of a set of contingency 
tables, one for each step of displace- 
ment, much in the same way that 
scatter diagrams might be constructed 
for the autocorrelation function. Each 
table has 28 X 28 cells (26 letters plus 
space and period). The first table 
shows the frequency with which each 
letter is followed immediately by each 
other letter. The next table contains 
the letters following a given letter 
skipping over one, as AxA, AxB, etc. 
The third table skips two letters, thus, 
AxxA, AxxB, and so on for the fourth 
and fifth tables. 

The measure which is proposed 
describes the additional information in 
the second of the pair of letters com- 
pared with the information which that 
second letter might contain if it 
occurred alone. This is already a 
familiar computation, for it is iden- 
tical with the way in which a measure 
of redundancy is found. 


3 As is nearly always the case, the measure 
which is proposed here bears a generic relation- 
ship to certain alternative expressions. Shannon 
and others have defined a “distance” in probabil- 
ity space which is the converse of our “constraint” 
(Cybernetics Symposium, AAAS meeting, New 
York, December 28, 1949). Cf. also, Parke and 
Lamson (5) who casually use the term “coeffi- 
cient of correlation” much as we use “coefficient 
of constraint.” It is probable that there are 
others. No attempt has been made to survey 
the literature exhaustively. 
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To put the matter in more formal 
terms, the information contained in 
the single letters, assuming that they 
are independent of one another, is 
given by: 

a 

H = -¥ pli) logsp), (1) 
in which H is the measure of informa- 
tion, expressed in bits, and (i) is the 
empirical probability associated with 
one of the a alternatives. The infor- 
mation contained in single letters is 
easily computed by the application of 
this formula to the empirical probabili- 
ties in the sample under consideration. 

Each of the contingency tables may 
be treated in the same way. If the 
two selections, that is, for rows and 
columns, have been made independ- 
ently, the table will contain twice the 
information found for the single let- 
ters. The actual information is cal- 
culated as before: 


H(1,2) = -ZE p(éj) logep(ixf). (2) 


The expression, H(1,2), is read to 
mean the information contained in 
pairs of adjacent items in a sequence. 
From this it follows that the informa- 
tion added by the second letter of the 
pair is measured by the difference 
between these two estimates, or: 


H,(2) = H(1,2) -H(1), (3) 


which may be read, the information in 
item (2) when (1) is given is equal to 
the total information contained in the 
pairs (1,2) minus the information in 
(1) alone. The more general form of 
this expression would be: 


H,(n) = H(1,n) — H(1). (3a) 
It is now possible to define the co- 
efficient of constraint, D, as: 


H,(n) 


D(n) = 1- H(i)’ (4) 
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In general, zero constraint will occur 
when the nth selection is independent 
of the first selection, or: 


H,(n) = H(1). (4a) 


Similarly, the maximum value would 
be given if each column were uniquely 
determined by each row, in which 
case: 

H,(n) = 0, (4b) 


and D would become unity. It should 
also be noted in passing that D(2) is 
identical with the usual measure of 
relative redundancy for the second let- 
ter in a sequence of two letters. 


MEASURED CONSTRAINT IN 
PRINTED ENGLISH 


As a test of the usefulness of our 
method we have examined a 10,000- 
letter sample of printed English. The 
text used was taken from the Bible 
and consisted of the larger part of 
Isaiah xxix—xxxi in the King James 
version. The material was punched 
into Teletype tape using the 26 letters 
plus one symbol for space and a 
second symbol for period. No space 
was used following a period, and other 
punctuation was disregarded. The 
contingency tables were constructed 
by running the tape through our pres- 
ent Autocorrelator and Sequence Tab- 
ulator, a device that is a substantial 
elaboration of the device that has been 
previously described (3). However, 
even with the present machine, 28 runs 
were required for each table so that 
the total number of operations for the 
six tables here reported ran to about 
1,500,000. This was more than enough 
to test the adequacy of our mainte- 
nance procedures! 

Six contingency tables were con- 
structed by these methods. They 


were for steps 1-2, 1-3, 1-4, 1-5, 1-6, 
and 1-10. Only two of the tables are 
presented here, Table 1 for the first 
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step, 1-2, and Table 2 for the last one, 
1-10. Table 1 possesses some interest 
in itself for it gives in detail a tabula- 
tion of the digraphs in this sample. 
Since space and period are included, 
the table also shows, on a restricted 
sample, what is the distribution of 
letters which begin words and end 
words, and begin sentences and end 
sentences. The counts for words are 
obtained by combining space and pe- 
riod, of course. Moreover, there are 
1977 spaces and periods, and hence 
1978 words in the sample, and 120 
sentences. This gives an average 
word length (not counting the space) 
of just over four letters, and a sentence 
length of 16.5 words. 

The results of the entire calculation 
are summarized in Table 3. It gives 
in the first column H(1) (the same for 
all steps), in the second column the 
dependent “information,” H,(n), and 
in the third column the computed co- 
efficient of constraint, D(n). 

A rather striking regularity emerges 
when the values of D are plotted on 
logarithmic scales as in Fig. 1. In 
this plot the abscissa is the step that 
is compared with step (1). The first 
point must represent a value of D 
equal to unity by definition. The 
first calculated point is plotted above 
the value (2), and so on for the other 
five values. The straight line which 
has been drawn has a slope of —2.0. 


TABLE 3 


CoErFIciENT oF Constraint (D) ror Six 
STeps IN THE SEQUENTIAL ANALYSIS 
or Printep Encuisn Text 











Step A(1) Ai(n) D(n) 
1-2 4.08 bits} 3.17 bits} .223 
1-3 4.08 3.66 -103 
1-4 4.08 3.82 064 
1-5 4.08 3.92 039 
1-6 4.08 3.97 027 
1-10 4.08 4.03 012 
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It is obvious that the straight line is 
an excellent approximation to the 
entire set of observed points and, with 
the possible exception of the second 
point, there is no significant deviation. 

If it is assumed that this straight 
line adequately describes the results, 
it is possible to sum them up in even 
more simple fashion by writing the 
empirical equation: 


D(n) = 1/n?. (5) 


As we shall show later, the rather 
simple form of this equation leads to 
an important simplification of the esti- 
mated information in a long sequence 
of symbols. 


A New EstTIMATE OF THE UPPER 
Bounp oF INFORMATION IN 
PRINTED ENGLISH 


With these empirical results in hand, 
we are emboldened to propose a new 
estimate of the amount of information 
in continuous text. In the actual use 
of language, the constraints which 
operate invariably extend over se- 
quences much longer than one or two 
letters. What we are interested in 
discovering is the information con- 
tained in each letter when the series is, 
for all practical purposes, infinitely 
long. Presumably, as the transitional 
probability depends on a longer and 
longer sequence, the information added 
by the final letter in the sequence 
approaches the average information 
over the entire sequence. Using the 
symbol F(n) for the information in the 
last letter of a sequence of (m) letters, 
it can be shown that: 


F(n) = Hi jk... na (n) 


— H(i,j,k...n)/n. (6) 


Since it is quite impractical to eval- 
uate either the successive values of 
F(n) or the limiting value represented 
by the right-hand side of the expres- 
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Fic. 1. The constraint exercised by one letter 
in printed English on a second letter following it 
after a varying interval. The interval is meas- 
ured by the number of letters in the sequence, the 
first or constraining letter being number one. 
Both n and D(n), the coefficient of constraint, 
are plotted on logarithmic scales. 


sion, some alternative must be found. 
Shannon has worked out one approach 
by deriving both an upper and a lower 
bound from the results of the guessing 
game. The reader is referred to 
Shannon’s paper for the details of the 
method (7). 

As a second alternative, we are pro- 
posing that the coefficient of constraint 
be regarded as a proportion by which 
the amount of information per letter is 
successively reduced as the constrain- 
ing sequence is made longer and longer. 
Using the term F(n) as defined in 
equation (6) above, that is, as the 
additional information added by the 
nth letter in a sequence of n letters, 
we would write: 


F(n) S (1—D(n))-F(m — 1). (7) 


It is possible to show that this relation 
holds in several limiting cases. For 


one thing, the equality holds in the 
case of the first step in the sequence. 
Since D(2) is the relative redundancy 
and F(1) is the information in the first 
letter of a sequence, it follows as a 
matter of definition that: 


F(2) = (1—D(2))-F(1). (7a) 


Furthermore, we may define the be- 
havior of both measures if it is assumed 
that any item is independent of the 
nth preceding item. In this case we 
have the known limiting equality 


F(n) = F(n—1). (8) 


At the same time, we have by defini- 
tion zero constraint so that equation 
(7) reduces to equation (8). Finally, 
if D(n) should be unity, equation (7) 
says that F(n) becomes zero. This 
outcome is intuitively correct and we 
omit the somewhat tedious demon- 
stration that F(n) should indeed be- 
have this way. In general, however, 
no proof of equation (7) has been 
found and there are limiting cases in 
which it is proved not to apply. 

If it is granted for the moment that 
the argument is at least intuitively 
correct, there are then two methods of 
procedure open tous. The first would 


TABLE 4 


Comparison oF Various EsTIMATES OF THE 
AppiTIONAL Inrormation, F(n), ADDED By 
THE nTH LETTER IN A SEQUENCE 











Shannon 
step | Shannon, | Smoohed | Present 
1 4.03 4.05 4.08 
2 3.42 3.45 3.17 
3 3.0 3.0 2.84 
4 2.6 2.7 2.66 
5 2.7 2.5 2.56 
6 2.2 2.3 2.49 
7 2.8 2.2 2.44* 
8 1.8 2.1 2.40* 
9 1.9 2.0 aar” 
10 2.1 2.0 2.34 
100 1.3 1.9(?) 2.06 














* By interpolation from Fig. 1. 
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be the direct computation of the suc- 
cessive values of F(n). These are 
given in Table 4 together with a com- 
parison of the values read from Shan- 
non’s smoothed curve, and his observed 
values. It is reasonably clear that the 
agreement is fairly good although it is 
not possible to give any adequate esti- 
mate of how good since there is no 
alternative view against which to test 
it. 

The second possibility arises from 
the fact that it was possible to derive 
an empirical equation (5) for D(n) 
above. Let us note first that equa- 
tion (7) may be expanded as follows: 


F(n) S F(1)-(1—D(2)) 
(1—D(3)) --- (1-D(m))._ 9) 


Substituting, according to equation 
(5) and replacing F(1) with H(1), we 
get: 


F(n) = H(1) (1-1/2) 
-(1—1/32) «- » (1—1/n%). (10) 


By rearranging terms, this becomes, 
successively, 


F(n) = H(1) [(2?—1)/2?] 

-[(3?—1)/3?] --+ [(n?—1)/n*] (10a) 

< H(1) [3-4-5 +++ (m+1)] 
[22-32-42 --~ nt] (10b) 


= H(1) (n+1)/2n. (11) 


An examination of this equation 
shows that it has several interesting 
properties. In the first place, it has a 
real limit, for as n approaches infinity, 
the right side of the equation (11) 
approaches H(1)/2. This limit is ex- 
actly the value which we wanted above 
in equation (6). To be sure we believe 
that we have here only an upper bound 
to such a limit, but the available evi- 
dence suggests that the actual values 


do not fall too far below such an upper 
bound. 
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A second property of the equation 
is the fact that it rises to plus infinity 
for n = 0. It will be recalled that it 
has been customary to calculate a 
value for H(O) by assuming that each 
of the 26 letters was represented with 
equal probability. Actually, a little 
reflection will show how arbitrary has 
been this assumption. It is just as 
possible to assume that the sender has 
a very large set of letters from which 
he might have drawn, and for which 
we can assume with equal justice an 
equivalent share of the total proba- 
bility. The only thing we know is 
that he uses only 26 of these letters 
with a finite probability at the first 
(empirical) step. An extension of 
this reasoning leads rather easily to 
the acceptance of the notion of infinite 
information at the zero step, if indeed 
any meaning at all is to be attached to 
values of nm less than 1. The behavior 
of the function is consistent with this 
line of reasoning. 

Finally, it becomes interesting to 
see whether this equation is particular 
to the form which equation (5) hap- 
pened to take. A more general form 
of this equation might be: 


D(n) = 1/n’, (12) 


in which b is an arbitrary slope con- 
stant from a plot such as Fig. 1. 
Except for the integral values of b = 1 
and b = 2, there appears to be no gen- 
eral solution for the form of the func- 
tion describing F(n). Much more 
empirical evidence will be required 
before any great generality can be 
claimed for either equation (11) or 
equation (12). 

The best summary of the results of 
this development is the plot in Fig. 2. 
In this are drawn Shannon’s curve for 
the upper bound (dashed line) and our 
function according to equation (11) 
(solid line). Further, there are plotted 
the empirical values taken from Shan- 
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Fic. 2. The information added by the final letter in a sequence n letters long. The figure 
shows in comparison both the experimental points (open circles) and the smoothed curve 
(dashed line) from Shannon (7) and the calculated points (solid circles) and computed function (con- 
tinuous line) from the present paper. Our function has an asymptote at 2.04 bits. 


non’s article (open circles) and our 
results when calculated by the ex- 
panded form of equation (9) (solid 
circles). Within the range for which 
there are empirical data, there is no 
clear choice between the two functions. 
Extrapolation should be made with 
caution. 


ConFIDENCE Limits oF INFORMATION 
EsTIMATES 


It is not unnatural that the psychol- 
ogist should be particularly cautious 
about the confidence limits which 


attach to any extended observations 
of this sort. He has been trained 
always to inquire about the magnitude 
of sampling errors. Unfortunately, 
we know of no general solution to this 
problem for information measures, so 
that it seemed worth while to resort to 
somewhat more primitive methods. 
The obvious method of attack was by 
way of x? since the basic tables are 
effectively contingency tables. The 
following results are reported for what 
preliminary assistance they may be. 
As a first step we computed both 
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D(n) and x? for six sets of reduced 
tables formed by combining rows and 
combining columns of the large tables. 
Thus, we could form a 3 X 3 table in 
which the classification was Vowel- 
Consonant-Space. These reduced 
tables were 2 X 2, 3 X 3, and 4X4 
in size. In every case, we found for a 
given sample and a given set of mar- 
ginal totals that there was an almost 
precise linear relation between the two 
measures. There are several param- 
eters of this relation at which we can 
guess roughly. One is the total N of 
the sample. D(n) is entirely inde- 
pendent of N (except for sampling 
variation) whereas x? is proportional 
to N. Furthermore, x? is apparently 
proportional to H(1) (or to log a, 
where a is the number of categories). 

Encouraged by this evidence, we 
have worked out the x? for a 10 X 10 
table derived from Table 2 above. 
The reduction was accomplished prin- 
cipally by combining some of the less 
frequently used letters. For this table, 
x’ equals 121 which gives a p = .004. 
Thus this 10 X 10 table shows a sig- 
nificant constraint, but by extra- 
polating it can be estimated that the 
full table for the 1 to 10 step probably 
does not differ significantly from 
chance. The full table for Step 6 
would, on the other hand, certainly 
prove highly significant. Roughly it 
can be said that there are probably 
significant dependencies contained 
within the table for Step 10, but that a 
judicious and, one hopes, legitimate 
combination of columns is necessary 
in order to bring them out. 


INFORMATION AND LINGUISTIC 
CLASSIFICATION 


It is not possible to make a study 
such as this without becoming curious 
about the relation of this estimate of 
the structure of printed language to 
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other studies of lexical structure. 
Some of these relations are worth brief 
comment here. 

One expectation which we had in 
making the study was not fulfilled. 
This was that there would appear a 
rather sharp drop in the coefficient of 
constraint about the fifth or sixth 
letter. This was based on the premise 
that the sequential dependencies con- 
necting words would be ever so much 
weaker than the dependencies within 
words, and that the fifth and sixth 
letters would represent the beginning 
of new words. On the average, this 
appears not to be true. Perhaps the 
reason is that the break is blurred by 
various sorts of statistical variability. 
Words are not all the same length. 
There are some very high sequential 
dependencies among some words. 
Other explanations will occur to the 
reader. But the basic fact remains 
that there is very little evidence here 
that the order of magnitude of this 
dependency is different for words than 
for letters. If it were, we would have 
every reason to expect to find some 
sharp changes in slope somewhere 
about the length of the median word. 

A second line of evidence seems to 
confirm this view. If our data are 
used to compute the amount of infor- 
mation per word, an estimate of 10.2 
bits per word is obtained. This is 
simply the average word length, 5 
letters, multiplied by the limiting 
value of information per letter, 2.04 
bits. This would predict a vocabulary 
of about 1300 equally used words. 
But we know with sme certainty that 
words are not used equally so that the 
vocabulary would hae to be much 
larger. Actually, both Shannon (7) 
and Miller (2, p. 798) have independ- 
ently taken the well-knowa harmonic 
function of Zipf (9) and calcithated the 
presumed information per wordfrom 
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the statistics of word use alone.‘ This 
gives estimates between 9.7 and 11.8 
bits per word. Since we have not 
made any word counts on the material 
used here, it would appear that our 
estimate, based on letters, is as close 
as casual sampling would predict. 

One final comparison has been made. 
By taking the sequential probabilities 
represented in Table 2, it is possible to 
compare the constraint exercised by 
letters within a word with the con- 
straint on the initial letter of each 
word. This calculation shows that H 
for initial letters in our sample is 4.00 
bits, only slightly less than the 4.08 
for all letters and spaces taken singly, 
and almost identical with the value of 
4.01 calculated for single letters alone 
in this sample. Put in other terms, 
although the distribution of initial 
letters is quite different from distri- 
bution of all letters, knowledge of this 
distribution places no greater con- 
traint on the choice of a letter than 
does the choice of a letter at random 
from the total sample. This is in sharp 
contrast to the constraint implied by 
the remainder of Table 1. With the 
initial letters removed, the H is 
reduced from 3.17 to 2.96 bits per 
letter. It should be emphasized that 
in this paragraph, the constraint being 
discussed depends upon the single pre- 
ceding letter. Our tables do not per- 
mit further analysis. 

The consequence of this last finding 


‘ There seems to be some disagreement about 
the exact numerical value to be expected. 
Shannon says that the sum of the first 8727 ranks 
gives a total probability equal to unity. Miller, 
presumably using the definite integral, comes out 
with 22,000. Our own calculation, which takes 
into account the discontinuous values for the 
first 100 ranks, gives 12,370. On the other hand, 
Shannon gets a figure of 11.82 bits per word, 
Miller gives 10.6 bits per word, and our smaller n 
only gets this estimate up to 9.7 bits per word. 
The reason for our disagreement with Shannon 
is not clear. 


must be taken together with the fact 
that the space itself is largely redun- 
dant. We know of no numerical re- 
sults to demonstrate this, but the most 
casual inspection of guessing patterns 
will show that this is true. What is 
evidently happening, then, is that the 
space itself contributes little infor- 
mation, but the space plus the initial 
letter, when taken together, supply 
almost as much information as the 
average pair of letters. 

In conclusion it can only be said 
that the present tools when applied to 
letters are not adequate to sense the 
somewhat subtle sequential depen- 
dencies among words. Actually the 
striking fact is that the constraints on 
letters are so much more marked than 
those among words. It seems prob- 
able that questions of grammatical 
structure and the like must be tackled 
directly. 


SUMMARY 


1. A new measure, the coefficient of 
constraint, is proposed which provides 
a limited but extremely useful esti- 
mate of the sequential dependencies in 
sequences of events, such as letters, 
which cannot be ordered into a metric 
scale. 

2. When the coefficient of constraint 
was calculated for a passage of printed 
English, a quite regular behavior of 
the coefficient was discovered which 
could be described by the expression 
D(n) = 1/n?. 

3. A possible use of the coefficient of 
constraint is proposed by which an 
upper bound is set on the average 
information per letter in the text exam- 
ined. This upper bound has both con- 
siderable resemblance to, and a few 
striking differences from, Shannon’s 
estimate of the same function. 

4. Neither greater freedom nor 
greater constraint is discovered when 
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sequences of letters are examined of a 
length greater than that of one word. 


(Manuscript received for priority 
publication March 13, 1952) 
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EXPERIMENTS ON SENSORY-TONIC FIELD THEORY OF 
PERCEPTION: V. EFFECT OF BODY STATUS 
ON THE KINESTHETIC PERCEPTION 
OF VERTICALITY # 


SEYMOUR WAPNER AND HEINZ WERNER 
Clark University 


The previous papers of this series 
(1, 2, 4, 6) were concerned with organ- 
ismic effects on the visual perception 
of verticality. It seems important to 
extend the experiments beyond visual 
perception for the following reason. 
It is necessary to inquire into the gen- 
erality of organismic effects on objects 
independent of the modality in which 
they are perceived (3). Furthermore, 
the question arises whether specific 
peripheral factors, such as retinal dis- 
placement due to eye movements, etc., 
are capable of accounting for the effects 
on visual perception previously re- 
ported; or whether general organismic 
factors, irrespective of the involve- 
ment of peripheral factors, play a 
significant role. Consequently, exper- 
iments were carried out on the effect 
of “extraneous stimulation,” i.e., stim- 
ulation from a source other than the 
object, on the perception of verticality 
as experienced kinesthetically. 

In the present paper the results of 
three experiments on the kinesthetic 
perception of verticality are reported: 
(a) effect of head tilt; (b) effect of body 
tilt; (c) effect of concentric body 
rotation. 


APPARATUS AND PROCEDURE 


In all three experiments, the task for S, while 
blindfolded, was to adjust a rod to the vertical 
kinesthetically. The rod was 31 in. long; its 
lower end was pivoted at S’s waist level so that 
it rotated in the fronto-parallel plane. The angle 


1 This investigation was supported by a grant 
from the National Institute of Mental Health of 
the National Institutes of Health, Public Health 
Service. 


of inclination to which S adjusted the rod was 
directly read from a protractor. The S grasped 
the rod with thumb and finger tips of both hands; 
he then had to move both hands simultaneously 
up and down the rod, and to change its angle of 
tilt until it appeared vertical to him. 

Experiment I: Sidewise head tilt—In this 
experiment the task was carried out under three 
conditions of head position: (a) sidewise tilt of 
head toward the left shoulder at an angle of 45°; 
(b) head upright; (c) sidewise tilt of head toward 
the right shoulder at an angle of 45°. For each 
of these main conditions two trials were given 
with the rod initially set in each of three starting 
positions: 30° Left; 0° (plumb line); and 30° 
Right. Thus a total of 18 trials was given to 
each S. Twelve men and 12 women served as Ss. 
The presentation of trials was randomized, and 
the data were analyzed as a three variable (head 
position, starting position, Ss) factorial design 
separately for men and women. 

Experiment II: Sidewise body tilt—In this 
experiment the task was carried out under three 
conditions of body position: (a) body tilted 30° 
L; (6) body upright; (c) body tilted 30° R. For 
the conditions of body tilt S rested his body, 
shoulder, and head against a leaning board of 
inverted Y shape, cut out at the shoulders to 
permit free movement of the arms. Here, as in 
the first experiment, for each main condition 
three starting positions were used: 30° L; 0° 
(plumb line); and 30° R. Two trials were given 
for each combination of conditions making a 
total of 18 trials. The presentation was in ran- 
dom order. Twenty-one men served as Ss. 
The data were analyzed as a three variable (body 
tilt, starting position, Ss) factorial design. 

Experiment III: Body rotation.—In this exper- 
iment the task was carried out under conditions 
of concentric body rotation. Labyrinthian stim- 
ulation was provided by a rotating chair de- 
scribed previously (2). Rotation was controlled 
by a constant speed motor through a Graham 
variable speed drive. One speed, 25 rpm, was 
used for both clockwise and counterclockwise 
motion. 

The period of acceleration, from zero to a 
constant speed of 25 rpm, lasted 15 sec.; decelera- 
tion, from constant speed of 25 rpm to 0 rpm, 
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TABLE 1 
Mean Position or APPARENT VERTICAL* 
Head Position 
Sex 
Left Tilt Upright Right Tilt 
Men +7.7° —1.0° —10.8° 
Women +6.4° —1.4° —9,3° 














* The plus sign indicates the position of apparent 
vertical clockwise of the plumb line (as viewed by S); 
the minus sign indicates the position of apparent ver- 
tical counterclockwise of the plumb line. 


lasted 9 sec. The head was tipped forward so 
that a line from the corner of the eye to the ear 
canal was horizontal; it was fixed in position by 
means of a headrest. The rod was mounted on 
' a horizontal board which, in turn, was fixed to 
the armrests of the rotating chair. 

The kinesthetic adjustment of the rod to the 
vertical was carried out under five test condi- 
tions: (a) Control: absence of rotation (C); (b) 
Acceleration in a clockwise direction (A-CW); 
(c) Deceleration from constant clockwise rota- 
tion (D-CW)); (d) Acceleration in a counterclock- 
wise direction (A-CCW); (¢) Deceleration from 
constant counterclockwise rotation (D-CCW). 

The sequence of test conditions was: C; 
A-CW; D-CW; A-CCW; D-CCW. Between 
each of these conditions a 3-min. rest period was 
interposed. The sequence was repeated making 
a total of ten trials per S. In contradistinction 
to the two previous experiments only one starting 
position was used: the rod was set at objective 
vertical at the beginning of each trial. Six male 
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Ss were employed. The data were analyzed by 
a two-way classification (conditions, Ss) analysis 
of variance. 

Measures employed.—In all three experiments 
the position in which the rod was kinesthetically 
perceived as vertical (apparent vertical) was 
measured in degrees of deviation from the objec- 
tive vertical (plumb line). Arbitrarily, an ang- 
ular position—as viewed by S—clockwise of 
objective vertical is designated as “+” (tilted 
right), and counterclockwise as “—” (tilted left). 
The measure used to represent S’s performance 
is the algebraic mean of the adjustments for each 
condition. 


RESULTS 


Experiment I: Sidewise head tilt.— 
The results of the effect of head tilt 
are summarized in Tables 1 and 2. 
For both men and women, the F test 
for the main experimental variable, 
head tilt, is significant beyond the .01 
level of confidence. The mean posi- 
tion of the apparent vertical is rotated 
to the left of control (head upright) 
when the head is tilted right, and to 
the right of control when the head is 
tilted left. The means of apparent 
vertical under the three test conditions 
differ significantly from one another 
(p < .01) for both men and women. 

Experiment II: Sidewise body tilt.— 
In general, the effect of body tilt is 
similar to that of head tilt, i.e., when 


TABLE 2 


ANALYsIs OF VARIANCE oF KinestHeTIc ApjusTMENT OF Rop To VERTICALITY 
unDER Heap Titt (Exp. I) 














Women Men 
Source of Variance df 
mei? | + | Sesh 7 |e 
Head position (H) 2 4449.40 | 74.21* 1 6185.36 | 33.34¢ | <.01 
Starting position (S) 2 206.79 3.60 | > 21.47 1.494 | >.05 
Individuals (I) 11 663.63 2.00° >.05 532.30 | <1.00° | >.05 
HxS + 115.05 3.804 <.01 39.53 1.374 | >.05 
HxXxI 22 664.37 3.994 <.01 2040.86 12.884 | <.01 
SsxI- 22 212.32 1.284 82.75 | <1.00°| >.05 
HxSsxI 44 332.51 316.92 
Total 107 6644.07 9219.19 


























* Tested against pooled error (H XK I and H X S). 
> Tested against H x S 
© Tested against H xX I. 
4 Tested against H XS 


XL 
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TABLE 3 


ANALYSIS OF VARIANCE OF KINESTHETIC 
ADJUSTMENT OF Rop To VERTICALITY 
uNDER Bopy Titt (Exp. II) 











P Sum of 

Source of Variance df Squares F P 
Body position (B) 2 {1786.57 | 19.91" | <.01 
Starting position (S) 2 21.46 1.24> | >.05 
Individuals (I) 20 | 369.56 | <1.00* | >.05 
Bxs 34.64 2.81¢ 05 
B XI 40 |1794.54 | 14.56° | <.01 
S xI 40 | 153.65 1.25¢ 
BXSxI 80 | 246.25 

Total 188 |4406.67 

















* Tested against B X I 
> Tested against B X S. 
Bxs 


© Tested against xL 


the body is tilted to the right, the 
mean position of the apparent vertical 
is rotated to the left (—2.9°) of control 
(1.6°), and when the body is tilted to 
the left, the mean position of the appar- 
ent vertical is to the right (4.6°) of 
control. The positions of the appar- 
ent vertical under all three test condi- 
tions differ significantly from each 
other (p < .01). The analysis of var- 
iance is summarized in Table 3. 
Experiment III: Body rotation.— 
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When S is rotated around his own 
vertical axis there are significant effects 
of labyrinthian stimulation on the 
kinesthetic perception of verticality. 
These effects are summarized in Table 
4. Under control conditions (no rota- 
tion) the apparent vertical is closest 
to objective vertical (—.5°). Under 
CW acceleration (A-CW) the appar- 
ent vertical is tilted to the left (—2.6°) 
of control; under CCW acceleration 
(A-CCW) the apparent vertical is 
tilted to the right (.8°) of control; 
under deceleration from CW rotation 
(D—CW) the apparent vertical is tilted 
to the right (1.3°) of control; and under 
deceleration from CCW rotation (D- 
CCW) the apparent vertical is tilted 
to left (—3.2°) of control. Thus the 
apparent vertical is shifted opposite 
the direction of acceleration, i.e., when 
acceleration is to the right, the tilt of 
the rod in the fronto-parallel plane is 
to the left, and vice versa.2, Moreover 


2 As mentioned in (2), this statement is dynam- 
ically but not geometrically correct. That is, 
CW and CCW with reference to rotation deal 


TABLE 4 


KinestHetic ADJUSTMENT OF Rop To VERTICALITY UNDER Bopy Roration (Exp. III) 














F tests 
Source of Variance df Sum of Squares F 
Conditions (C) 4 95.34 20.03*** 
Individuals (I) 5 13.97 2.34 
Error (C X I) 20 23.80 
Total 29 133.11 











t tests (Mean differences) 








Conditions Mean A-CW D-CW A-CCW D-CCW 

Control (C) —.5 2.1% 1.8** 1.3* 2.6*** 
Acceleration CW (A-CW) —2.6 3.990 3.4*** 5 
Deceleration from CW rotation 

(D-CW) +1.3 5 4.4*** 
Acceleration CCW (A-CCW) +.8 3.9°e* 
Deceleration from CCW rotation 

(D-CCW) —3.1 




















* Approaches significance (p = 


06). 
** Significant at or below the .05 level of confidence. 
** Significant at or below the .01 level of confidence. 
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deceleration from rotation in one direc- 
tion functions like acceleration in the 
opposite direction. 

The ¢ tests in Table 4 show that the 
mean position of apparent vertical 
without stimulation (control) is sig- 
nificantly different from the apparent 
vertical under each of the experimental 
test conditions. Acceleration in one 
direction differs significantly from ac- 
celeration in the other direction; this 
holds also for deceleration. Acceler- 
ation in one direction differs signifi- 
cantly from deceleration in the opposite 
direction. However, in both cases 
acceleration in one direction does not 
differ significantly from deceleration 
from rotation in the other direction. 


Discussion 


The main aim of this study was to 
inquire into the generality of the 
effects of extraneous stimulation on 
object perception which have been pre- 
viously demonstrated on the visually 
perceived vertical (1, 2, 4, 6). In the 
previous studies it has been found that 
there is a specific effect of extraneous 
stimulation when asymmetrically ap- 
plied. If an extraneous stimulus (e.g., 
auditory, labyrinthian, etc.) is applied 
to one side of the body, a visual line 
appears vertical when rotated away 
from the side stimulated. Related 
results were obtained with stimulation 
ensuing from body tilt: when the body 
is tilted slightly to one side, the posi- 
tion of the apparent vertical (visual) is 
tilted to the other side. 

Verticality is a spatial property of 
an object, which can be experienced 
by kinesthetic as well as by visual 
cues. One may, therefore, utilize this 
with motion around the vertical axis of the oder, 
whereas left and right deals with rotation in the 
fronto-parallel plane. 


3 The p value (.06) for the difference between 
A-CCW and control approaches significance. 


property to find out whether the effects 
of extraneous stimulation and body 
tilt are specific to visual object percep- 
tion or whether they are of a more 
general nature. The significance of 
the present experiments is the demon- 
stration that for kinesthetic perception 
of the vertical results have been ob- 
tained essentially identical in nature 
with those previously reported for vis- 
ual perception of the vertical. That 
is, when S is blindfolded, the kines- 
thetically perceived apparent vertical 
is in a position tilted to the left when 
the head is tilted to the right, when the 
body is tilted to the right, and when 
the body is accelerated to the right or 
decelerated to the left. Analogous 
relations hold for the opposite test 
conditions. 

These results suggest that interpre- 
tations which do not depend on the 
specific characteristics of one or the 
other modality are more satisfactory 
than those which take into account 
only a specific modality—visual or 
kinesthetic but not both. 

Such specific interpretations of per- 
ceptual change are, for instance, those 
which deal with retinal displacements 
subsequent to compensatory torsion of 
the eye, compensatory eye movements, 
nystagmus, etc. Wedonot deny that 
ocular-muscular changes involved in 
eye-movements, nystagmus, etc., are 
of significance. On the contrary, by 
their very nature as part of the organ- 
ismic state, they must be considered. 
However, there is a difference whether 
the interpretation is formulated in 
terms of eye movements as ocular- 
muscular events or in terms of eye 
movements as conditions for retinal 
displacements. 

In contradistinction to specific vis- 
ual theories our own interpretation, 
because of its organismic nature, is 
general enough to take care of results 
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A. Extraneous Stimulation 


Left Head Tilt 

Left Body Tilt 

Left Auditory 

Left Neck Muscle 
CCW Rotation (Accel.) 


B.Extraneous Stimulation 


Right Head Tilt 

Right Body Tilt 

Right Auditory 

Right Neck Muscle 
CW Rotation ( Accel.) 


Fic. 1. Hypothetical schema: Relation between position of apparent vertical and body 
equilibrium under various types of extraneous stimulation 


irrespective of the sense modality in- 
volved in the perception. 

In particular, the sensory-tonic field 
theory of perception fulfills the gener- 
ality requirement because the same 
hypothetical mechanism is applicable 
for experimental results on kinesthetic 
as well as on visual perception of ver- 
ticality. This mechanism is schemat- 
ically presented in Fig. 1. There are 
two assumptions: (a) A line is per- 
ceived as vertical when its position in 
space is in tune with the existing body- 
equilibrial state. (b) Asymmetrically 
applied extraneous stimulation includ- 
ing body tilt shifts the body-equilibrial 
axis away from the side of stimulation. 
With extraneous stimulation the body- 
equilibrial state, represented by the 
equilibrial axis, is that depicted in 
Fig. 1A and 1B; there the position of 
the apparent vertical must be dis- 


placed in the direction opposite the 
side of stimulation to become com- 
mensurate with the changed body- 
equilibrial axis.‘ 

Finally, it should be emphasized 
that the present experiment provides 
new evidence for the validity of the 
construct of “functional equivalence.” 
Labyrinthian stimulation and body 
tilt are factors affecting verticality in 
an equivalent way, independent of 
whether the affected object is per- 
ceived kinesthetically or visually. 


SUMMARY 


The present experiment was carried 
out to study the effect of head tilt, 
body tilt, and concentric body rota- 
tion on the kinesthetic perception of 


‘The mechanism hypothesized here has been 
discussed in (4) and will be discussed in greater 
detail in a forthcoming theoretical paper (5). 
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verticality. The S, while blindfolded, 
had the task of adjusting a rod pivoted 
at the lower end, to the vertical by 
moving the fingers of both hands along 
the rod. 

The results are essentially identical 
with those previously reported for vis- 
ual perception of the vertical. That 
is, the position of the rod that appeared 
vertical was objectively tilted to the 
left when: (a) the head was tilted 
right; (b) the body was tilted right; (c) 
the body was accelerated around the 
vertical axis in the clockwise direc- 
tion; (d) the body was decelerated 
from constant counterclockwise rota- 
tion. For the opposite test conditions 
the respective displacements were in a 
direction opposite to those mentioned 
above. 

The significance of this study con- 
sists in the demonstration that the 
effects of extraneous stimulation and 
body tilt previously described for vis- 
ual perception of verticality can be 
generalized to include kinesthetic per- 
ception of verticality. The results, 
thus, broaden the interpretations given 


within the sensory-tonic field theory 
of perception. 


(Received for priority publication 
April 10, 1952) 
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THRESHOLDS OF THE HUMAN EYE FOR ELECTRIC 


STIMULATION BY DIFFERENT WAVE FORMS! 


J. W. GEBHARD 
Applied Physics Laboratory, The Johns Hopkins University 


Much progress in visual science has 
been made by measuring the electric 
response of the eye to a light stimulus 
(9, 10). Less deeply explored has 
been the possibility of revealing some 
of the mechanisms of vision by the 
reverse process. This is to use an 
electric input and to measure the 
response in terms of the sensation of 
light such a stimulus calls forth. In- 
vestigators have shown sporadic inter- 
est in this matter ever since LeRoy 
(15) discovered in 1755 that an electric 
pulse to the eye will easily produce a 
bluish-white sensation of light. Visual 
perceptions caused by inadequate 
stimuli are called phosphenes and the 
electrical phosphenes have been stud- 
ied in a variety of respects for many 
years. Most of the early work was 
qualitative, but since 1900 measure- 
ments have frequently appeared. In 
general, there has been poor agreement 
among the findings reported and the 
history of inconsistency continues into 
the research of the present. 

A new and perhaps very important 
turn has been given this work by the 
studies of Motokawa and his associ- 
ates (1, 8, 16, 17,25). These workers 
have been ingenious in their use of the 
threshold of an electrical phosphene 
as a means of analyzing the effect of 
photic stimulation on the activity of 
the visual organ. Ina formidable out- 


1 This research was done while the author was 
on the staff of The Johns Hopkins University 
Institute for Cooperative Research, Baltimore, 
Md. It was carried out under Contract N5-ori- 
166, Task Order I, between the Special Devices 
Center, Office of Naval Research, and The Johns 
Hopkins University. This is Report No. 166-I- 
145, Project Designation No. NR-784-001 under 
that contract. 


put of some 40 papers in the last four 
years, they have touched upon the 
electrophysiology of brightness and 
color discrimination, color deficiency, 
contrast, spatial summation, inhibi- 
tion, visual illusions, and flicker effects 
(8). This work is of considerable 
interest and is a fresh approach to the 
physiological basis of vision. Never- 
theless, in spite of the promising na- 
ture of the published results, it is much 
too early to characterize them firmly. 
One can only say now that a great 
deal of verification must be done be- 
fore Motokawa’s findings are wholly 
accepted. 

The main concern of this paper is 
with a problem in electrostimulation 
in which certain results of Motokawa 
disagree with those of other workers. 
The problem is: Does the stimulus 
strength-frequency relation have one 
or several minima? This relationship 
is found by measuring the electrical 
thresholds of just noticeable flicker 
phosphenes for different frequencies of 
intermittent electric input. It has 
long been known from work on other 
tissues that there are optimum fre- 
quencies of excitation at which thresh- 
olds are lowest (13, 14). When thresh- 
old current J is plotted against log 
frequency n, U-shaped curves result 
that are symmetrical about the opti- 
mum frequency (5). The theoretical 
importance of the J — log n curves has 
been discussed by Hill (11), Coppée 
(6), Katz (12), Schaefer (19), and 
others. 

For the human eye, Rohracher (18) 
showed in 1935 that the strength- 
frequency curve as determined by sine 
waves passed through a distinct mini- 
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Fic. 1. Strength-frequency (J — log n) curves 
showing a single optimum frequency (after 
Rohracher and Schwarz) 


mum at about 20 cy./sec. for the light- 
adapted condition. Schwarz (20, 21, 
22, 23, 24), in a series of careful and 
varied experiments between 1936 and 
1947, verified and extended Rohr- 
acher’s finding of only one frequency 
of maximum stimulus effectiveness for 
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Fic. 2. Strength-frequency curves showing several optimum frequencies. 


light- and dark-adapted conditions, respectively. 


the light-adapted eye. Figure 1 
shows the data of these investigators. 

On the other side of the argument, 
Barnett and Posner (3), Motokawa 
and Iwama (17), and Abe (1) found 
the relationship to have several mini- 
ma for the light-adapted eye. Barnett 
and Posner used sine waves and re- 
ported a minimum at 22 cy./sec. for a 
“bright” condition of light adaptation 
and at 7 cy./sec. for a “dim” condi- 
tion. Motokawa and Iwama, who 
used both sine waves and rectangular 
pulses, reported minima at 18, 37, and 
53 cy./sec.? Abe also worked with rec- 
tangular pulses and claimed minima 
at 7, 20, 35, 54, and 74ccy./sec. Figure 
2 shows results of this kind. The data 
of Fig. 1 and 2 are clearly not in 
agreement. 

A second concern of this paper is 


? In another place Motokawa (16) also reported 
data on two minima while mentioning three and 
four. 
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with how the stimulus effectiveness of 
an intermittent electric input to the 
eye is affected by polarity and wave 
form. The problems of stimulus effec- 
tiveness and the strength-frequency 
relationship were studied at the same 
time in experiments conducted in 1948 
before seeing the papers from Mo- 
tokawa’s laboratory (7). This is un- 
fortunate with respect to the first 
problem in that insufficient frequencies 
were chosen above 25 cy./sec. to eval- 
uate adequately the question of mini- 
ma in this region. 


METHOD 


Stimulators.—Two electronic stimulators were 
used: a conventional sine wave oscillator and a 
rectangular pulse generator specially constructed 
for safe use on the eye (2). The second gener- 
ator provided anodal and cathodal pulses, or a 
train of pulses alternating in polarity. These 
were of controllable repetition rate, duration, and 
amplitude. Anodal or cathodal square waves 
were produced by adjusting the required polarity 
to a duty cycle (D/T) of .5. A “symmetrical” 
square wave was formed by alternating positive 
and negative pulses of the same duration. 

Measurements.—A nonpolarizable silver elec- 
trode, 17 X 17 mm. in size, was fixed to the 
glabella equidistant between the eyes with elec- 
trode paste. The other contact was made to the 
entire right hand. Application of the active 
electrode by scraping the skin and cleaning with 
acetone resulted in a DC resistance due to S of 
about 800 to 1000 2. Threshold measurements 
were made by means of an oscillograph across a 
high resistance in series with the subject. The 
values were read in peak current and equated for 
R.M.S. current. 

Procedure —Four adult Ss were used. An 
examination by an ophthalmologist before and 
after electro-stimulation revealed no retinal de- 
fects. Since visual adaptation has been shown 
to influence thresholds for electrically induced 
sensations, the Ss’ eyes were kept adapted to a 
constant level of light while measurements were 
taken. The S was seated in a dark experimental 
room with his eyes open and as close as possible 
to the center of a 36 X 36-in. diffusing screen 
illuminated from the back by white light of 
3600° K. This provided a reasonably homogene- 
ous adapting field at two levels of luminance: .1 
and 10 millilamberts. At S’s left hand was a 
knob that he used to control the current passing 
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through his body. His instructions were to adjust 
the knob until a condition of just noticeable flicker 
was perceived at either the center or periphery of 
the visual field as directed. He was allowed to 
bracket the threshold flicker region, taking as 
much time as desired, until satisfied with his 
judgment. The EZ arranged the frequency, wave- 
form, and duty-cycle settings from equipment in 
another room while S was switched out of the 
circuit. 

Conditions.—Four wave forms were used in 
determining the flicker thresholds, These were 
a sine wave, a positive pulse with a duty cycle of 
.5, a negative pulse with duty cycle of .5, and a 
“symmetrical” square wave made up of alter- 
nating positive and negative pulses of equal 
duration. Thresholds were obtained at 12 fre- 
quencies for each wave form except the sine 
wave. These were at 5, 7.5, 10, 12.5, 15, 17.5, 
20, 22.5, 25, 35, 45, and 60 cy./sec. For the 
sine wave, the frequencies at 5 and 7.5 cy./sec. 
were omitted because of limitations of the oscil- 
lator. The frequencies were presented in a ran- 
dom order until an equal number of measure- 
ments had been obtained at each frequency. 
The data are means based on either five or ten 
thresholds as indicated in the next section. Most 
of the thresholds were obtained for flicker ob- 
served in the central 5° of the visual field at the 
adapting luminance of 10 millilamberts. For 
purposes of comparison, however, some data 
were taken with flicker observed at the extreme 
periphery, and for an adapting luminance of 
-1 millilambert. 


RESULTS 


Strength-frequency relation—Table 
1 shows the mean threshold values in 
R.M.S. current for wave forms, fre- 
quencies, and Ss. Each value is based 
on five determinations. Figure 3 
shows the results for all wave forms 
when the threshold values of Ss are 
averaged. Individual curves were also 
drawn but are not shown. Inspection 
of the table and the individual curves 
drawn from it reveals that one mini- 
mum threshold, very nearly at 20 cy./ 
sec., is conspicuous between the fre- 
quency values of 17.5 and 22.5 cy./sec. 
There is evidence also, however, in 
certain of the data that very little 
change in threshold occurs between 
some of the individual points meas- 
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ured between 7.5 and 22.5 cy./sec. which these variations are “minima” 
There are several instances of no_ is questionable and is discussed below. 
change and two cases, both for Sub- Stimulus effectiveness —Data on the 
ject C, of reversals. The extent to problem of wave-form effectiveness 


TABLE 1 


Errect or Potariry anD WavE Form on TuresHoLps or Just NorTiceaBLe FLIcker 
OBSERVED IN THE CENTER OF THE FIELD 
Values are in mA equated for R.M.S. current. The luminance of 
the adapting field was 10 mL. 








Symmetrical Square Wave 





Frequency in cy./sec. 






































Subject 
5 7.5 10 12.5 15 17.5 20 22.5 25 35 45 60 
A 84 76 66 50 | .34 24 .28 32 36 40 82 | 1.74 
B 36 26 24 16 | .16 08 08 12 14 20 26 68 
90 48 38 38 | .40 30 | .30 36 40 60 90 | 1.48 
D 78 | 62 54 | 48 | 38 | .34 ‘se tr ee 66 | 1.08 
Average | .72 52 46 38 32 .24 .24 .28 32 44 66 | 1.24 








Anodal Square Wave 





Frequency in cy./sec. 








Subject 
5 7.5 10 12.5 15 17.5 20 22.5 25 35 45 60 
A 73 71 68 49 37 .27 a a. ae 42 83 | 1.01 
B 29 | .19 13 10 | 09 | .05 05 06 | .08 12 18 .34 
Cc .78 42 40 | 41 39 31 .28 .34 35 57 85 | 1.04 
D 56 45 Al 36 | 31 33 31 39 40 46 .57 .78 
Average | .59 44 40 34 .29 .24 .24 30 30 39 61 79 






































Cathodal Square Wave 





Frequency in cy./sec. 








Subject 
5 7.5 10 12.5 15 17.5 20 22.5 25 35 45 60 
A 80 59 49 40 | 30 | .22 | .20 26 30 34 64 95 
B 21 14 13 10 | 07 | 07 | .06 06 08 13 18 45 
Cc 75 45 42 41 | .41 32 | 31 31 36 62 78 98 
D cLeeiae 22 8f 2 OF £2 £2 ee ae 61 
Average | .54 | .38 | .32 ae | we | ae | i 25 34 | 48 75 






































Sine Wave 





Frequency in cy./sec. 




















Subject 
5 75 10 12.5 15 17.5 20 22.5 25 35 45 60 
A 80 59 i ae | ah iva 27 34 61 | 1.14 
B 18 13 11 | .10 | .06 | .09 11 14 22 67 
Cc 33 28 Pr 2 oe. 27 34 66 95 
D BMiSBiDisBsigaiaiA#i#ais 73 
Average 45 wo | wn 24 | .18 24 | .25 31 52 .88 
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Fic. 3. J — log n curves for different wave forms 
(from Table 1) 


makes it apparent that there is no 
difference between one polarity or 
wave form and another when the 
measures are equated for R.M.S. cur- 
rent. Square waves yield lower thresh- 
olds than sine waves by an amount 
predictable from the total AC power. 
The result that AC power controlled 
stimulus effectiveness was checked by 
separate measurements on the effect 
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Fic. 4. Stimulus effectiveness related to the duty 
cycle of rectangular pulses (from Table 2) 
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of duty cycle. The upper section of 
Table 2 shows complete measurements 
taken on the anodal pulse at 30 cy./ 
sec. Each value is the mean of ten 
measurements. Figure 4 gives the 
data in graphic form. Both anodal 
and cathodal pulses were further 
measured with a frequency of 35 cy./ 


TABLE 2 


Errect or Duty CycLte on THE StimuLus 
EFFECTIVENESS OF ANODAL AND CATHODAL 
RECTANGULAR PULSES 


Thresholds for just noticeable flicker in mA. 
The luminance of the adapting field was 10 mL. 








Anodal Pulse at 30 cy./sec. 














Duty Cycle 
Subject 
1 a 3 A SS 6 a 8 
A {1.12} .64 | .48 | .43 | .41 | .43 | .51 | .63 
B -32| .20 | .16 | .12 | .10 | .11 | .13 | .16 
C_ | 1.33) .67 | .60 | .46 | .43 | .46 | .48 | .62 


























Anodal Pulse at 35 cy./sec. 











Duty Cycle 
Subject ; 
3 8 7 
A A7 35 40 
B 12 10 13 
50 Al 45 





Cathodal Pulse at 35 cy./sec 











Duty Cycle 
Subject 
3 A 7 
A 56 43 52 
B ll .08 10 
os 58 52 68 














sec. at duty cycles of .3, .5, and .7. 
These results are also based on ten 
determinations and are given in the 
rest of the table. The data clearly 
show that pulses are most effective 
when the duty cycle is .5; i.e., when 
the pulse becomes a square wave and 
the total AC power is maximum. 
These findings indicate that the effec- 
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tive stimulus for the eye of an inter- 
mittent electric input is alternating 
current and that the stimulus effec- 
tiveness of any wave form is dependent 
upon the total power of the AC 
component. 


Discussion 


The general result of these measure- 
ments is that the visual mechanism 
shows only one frequency of electric 
input at which the threshold for flicker 
is conspicuously a minimum. This 
finding is in satisfactory agreement 
with the results of Rohracher and of 
Schwarz. Itis alsoin agreement with 
certain theoretical considerations of 
nerve response to alternating currents 
proposed by Hill, Coppée, and Schaefer 
that predict symmetrical, U-shaped 
I — log n curves. It appears not to 
agree with Barnett and Posner, Motok- 
awa and Iwama, or Abe, all of whom 
have found several well-defined minima 
in the strength-frequency curve. There 
is, nevertheless, a suggestion that the 
I — log ncurves obtained in this study 
are not as symmetrical as might be 
desired. However, the irregularities 
in the curves, as measured in milli- 
amperes, are very small. They reflect, 
if anything, the small number of 
measurements taken at each frequency 
rather than the presence of true 
minima. 

Barnett and Posner did not report the extent 
of their data and merely indicated that three 
Ss were used for the “bright” adaptation level 
and two for the “dim.” Neither did they state 
where, in the visual field, flicker was judged. 
This, taken with a rather crude means of con- 
trolling adapting luminance, makes their pub- 
lished curves difficult toevaluate. Their “bright” 
minimum at 22 cy./sec. corresponds fairly well 
with data from Rohracher and Schwarz. The 
“dim” minimum at 7 cy./sec. agrees only with 
Abe’s finding. 

Motokawa and Iwama measured voltage 
rather than current. This is perhaps open to the 
objection that variations of resistance due to 
changes in the tissues, sweating, loosening of the 


electrodes and the like, may unduly affect volt- 
age readings at threshold. In addition, their 
means of controlling adaptation was similar to 
that of Barnett and Posner. No indication is 
given of the amount of data used in constructing 
their curves and it may be noted that their 
results on sine waves (Fig. 2) show two quite 
different values of threshold plotted at the fre- 
quency of 30 cy./sec. In this figure they have 
carried the limb of one curve through one value 
and that of the adjacent curve through the other 
value. Unless this is an error, there is some ques- 
tion of the justification for this. Finally, the 
authors objected that Rohracher and Schwarz 
missed two additional minima because of an 
insufficient number of frequencies measured. 
But Schwarz (22, 23) measured the same number 
of points between 10 and 55 cy./sec. as did 
Motokawa and Iwama. They were not the same 
points, although they were equally well dispersed 
along the frequency range. 

Abe’s method of achieving a light-adapted eye 
was to pre-illuminate with 24,000 lux (meter 
candles) for an unspecified length of time and to 
measure thresholds after 1.5 min. had elapsed. 
This means that he was working in an early stage 
of dark adaptation rather than in a condition of 
maintained light adaptation. Again, voltage 
was measured using about the same technique as 
that of Motokawa and Iwama. It is interesting 
to note, however, that Abe’s thresholds were only 
about one-tenth as large. No details were given 
as to the amount of data obtained. More, it is 
difficult to see in Abe’s graphs for the light- 
adapted eye (L) a clear indication of the five 
minima he reports. Abe’s points appear to be 
fitted reasonably well by a smooth curve with a 
single minimum at 20 cy./sec. His curve for the 
dark-adapted eye (D) which shows a high thresh- 
old at 20 cy./sec. is completely at variance with 
evidence from Schwarz (24). Abe attributed the 
absence of the 20 cy./sec. minimum to the low 
electrical excitability of the rods in the dark- 
adapted condition. This hypothesis was studied 
in some detail by Tukahara and Abe (25) who 
presented data to show that rod sensitivity to 
electric stimulation at 20 cy./sec. disappeared 
when light was removed from the retina. 


As pointed out earlier, the fre- 
quencies above 25 cy./sec. measured 
in the present study were too far apart 
to disclaim the presence of additional 
minima beyond this frequency. It 
can only be noted that the threshold 
values at 25, 35, 45, and 60 cy./sec. 
are reasonably well fitted by a smooth 
curve. Schwarz, at different times, 








has measured 14 to 18 frequencies dis- 
tributed between 25 and 110 cy./sec. 
and his values are likewise satisfac- 
torily fitted by a smooth curve. 

There are many reasons why marked 
variation can result in the measures 
of thresholds such as these. Among 
the obvious difficulties are those 
related to intra-S variability and the 
technical problems of consistent elec- 
trostimulation and measurement. In 
addition, there are two other sources 
of difficulty. The first results from 
the fact that the electrical thresholds 
depend upon the adaptation level of 
the eyes. Theeffect of adapting lumi- 
nance is seen in Fig. 5 where threshold 
measures obtained at 10 and .1 milli- 
lamberts are compared. It is appar- 
ent that any inadvertent change in 
adaptation level will produce a shift 
in the thresholds over much of the 
frequency range. This same effect 
has been shown in the 1947 data of 
Schwarz (24). 

The second source of potential trou- 
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ble lies in controlling where, in the 
visual field, S looks for the flicker. If 
his judgments are made in the central 
5° of the field, the thresholds will be 
higher than if judgments are made 
elsewhere. A comparison of the cen- 
ter and the extreme periphery is made 
in Fig. 6. The reason for this differ- 
ence is not clear. It cannot be con- 
cluded from these measures that the 
thresholds at the center and the pe- 
riphery differ because of regional differ- 
ences in the visual mechanism. The 
method of controlling adapting lumi- 
nance used in this study resulted in 
illuminating the periphery of the ret- 
ina more weakly than the center. 
This would tend to reduce the thresh- 
olds at the periphery, assuming an 
equal electrical excitability of all areas 
of the eyes. However, Bourguignon 
and Déjean (4), using the entirely dif- 
ferent method of stimulating the eye 
with condenser discharges and meas- 
uring chronaxies, found that periph- 
eral phosphenes were easier to arouse 
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Fic. 6. Effect of position in visual field where flicker is judged on the J — log n curve 


than central ones. They reported 
chronaxies of 1-2 msec. for peripher- 
ally localized sensations and 2-3 msec. 
for those in the center. The signifi- 
cant fact for this discussion is that if 
S judged the flicker first at one place, 
then at another in the visual field, 
varying values of stimulus intensity 
would be found. Inadequate control 
of both adaptation level and of where 
flicker is judged, therefore, could con- 
ceivably produce “minima” where 
none exist. 


SUMMARY 


Threshold intensities of the inter- 
mittent electric excitation required for 
Ss to experience just noticeable flicker 
were obtained for light-adapted eyes 
with sine waves, anodal and cathodal 
square waves and rectangular pulses. 


1. Polarity was found to be unim- 
portant. Anodal or cathodal stimuli 
were equally effective in producing an 
electrically aroused phosphene. 

2. Square waves gave lower thresh- 
olds than sine waves by an amount 
predictable from the total AC power. 

3. Rectangular pulses were most 
effective when the duty cycle was .5; 
i.e., when the pulse became a square 
wave and the total AC power was 
maximum. These results indicate that 
the effective intermittent electric stim- 
ulus for the eye is alternating current 
and that the efficiency of any wave 
form is dependent upon the total 
power of the AC component. 

4. Thresholds for all wave forms 
were lowest at a frequency of about 
20 cy./sec. This frequency was the 
only one at which an unequivocal min- 
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imum was observed and is a result that 
is in agreement with Rohracher and 


Schwarz. 


It is not in agreement with 


Barnett and Posner, Motokawa and 
Iwama, and Abe, all of whom have 
found several minima in the strength- 
frequency relation. 


10. 





. Ase, Z. 
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